09346470Results2 



SEQ ID NO: 10 

ALIGNMENTS 

RESULT 1 
097095 

ID 097095 PRELIMINARY; PRT; 541 AA. 

AC 097095; 

DT 01-MAY-1999 (TrEMBLrel . 10, Created) 

DT 01-MAY-1999 (TrEMBLrel. 10, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE ECDYSONE RECEPTOR. 

OS Locusta migratoria (Migratory locust) . 

OC Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 

OC Pterygota; Neoptera; Orthopteroidea; Orthoptera; Caelifera; 

OC Acridomorpha; Ac ri do idea; Acrididae; Locusta. 

OX NCBI_TaxID=7004; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=FAT BODY; 

RA Saleh D.S., Zhang J., Wyatt G.R., Walker V.K.; 

RT "Cloning and characterization of an ecdysone receptor cDNA from 

RT Locusta migratoria."; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC -!- SIMILARITY: TO C4 -TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL ; AF04 9136; AAD1982 8.1; 

DR HSSP; P203 93; 1A6Y . 

DR Inter Pro; IPR000536; Hormone__rec_lig . 

DR InterPro; I PRO 01 72 3; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4 . 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PRO 0 04 7; STROIDFINGER . 

DR SMART; SM00430; HOLI; 1. 

DR SMART ; SM00399; ZnF__C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc -finger. 

SQ SEQUENCE 541 AA; 59197 MW; 5C7A5ABD537F89CB CRC64; 

Query Match 51.2%; Score 1613.5; DB 5; Length 541; 

Best Local Similarity 57.3%; Pred. No. 4.3e-112; 

Matches 341; Conservative 65; Mismatches 112; Indels 77; Gaps 

Qy 23 GG VGGLMS YNRGRGGT E VI I K PR S P A WQ V - - ATGG S YHGL P A -AS DA V I VRS P P 74 

I I I I ! : II II I I I I I : I | : : | : : | 

Db 6 GADGALPSASASASASASGAPAASPLAVSVPLALPLPGHASPASAADALWKTEPREAGA 65 

Qy 75 GGHLPGPQQQVPPSRNGCSTLFSDIAGVKRLRPD-DWL-AVNSPPASSPGTSH 125 

I IN M I I III : I I II | 

Db 66 LFAAISSPGQGPGP AKRARLD S DWL S S PG SNAA P S P P PHH 105 

Qy 126 ISYTVISNGGGGGGGGGGGYNTSPMST-NSYDPYSPMSGKIVKEELSPPNSLSGVSSHSD 184 

I I II Ihh lllllll II :, :| >: 

Db 106 LFGAAASASAGAPAALPNGY-ASPLSSGGSYDPYSP-GGKIGREDLSPLSSLNGYSADSC 163 

Qy 185 G L KKKKLNHT P S TGWNT S AS G PGGG VGGNVLNNR P P E E LC L VCGD RS S G YH YNALTC EG 244 

nh i,. niUi : 1 1 IN I Ml I IN I III I II II II 

UiD 164 DAKKKK GAAP RQQEELCLVCGDRASGYHYNALTCEG 199 

Qy 245 CKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECWPEVQCAVK 304 

MMIMIIMMIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIMIIMII lllll 



Db 


200 


CKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECVVPEYQCAVK 


259 


Qy 


305 


RKEKKAQREKDKPNSTTDISPEIIKIEPTEMKIECGEPMIMGTPMPTVPYVKPLSSEQKE 


364 


Db 


260 


Iltlll|::||||||||: 111 = = = = = l = i 1= 1 : li = l 11 = 1 
RKEKKAQKDKDKPNSTTNGSPEVMMLKDIDAKVEPERPLSNG IKPVSPEQEE 


311 


Qy 


365 


LIHRLVYFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRIITEMTILTVQLIVEF 

: | I 1 IMM Ill 

LIHRLVYFQNEYESPSEEDLRRVT SQPTEGEDQSDVRFRHITEITILTVQLIVEF 


424 


Db 


312 


366 


Qy 


425 


AKRLPGFDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYNKAGL 


484 


Db 


367 


MIIIMMhlllM llllllll 1111 = 11 = 11= =111 = 111111 = = INI 11 = 

AKRL PG FD KLLRE D Q I ALL KAC S S E VMM F RMARR YD VN S D S I L F ANNQ P YT KD S YNLAGM 


426 


Qy 


485 


GDAIENQLSFSRFMYNMKVDNAEYALLTAIVIFSSRPNLLDGWKVEKIQEIYLESLKAYV 

: ||: | | | || IIIIIIIIMIIMIMI lhh = IIIIIIIIMIIhlllll 
GETIEDMLRFCRQMYAMKVDNAEYALLTAIVIFSERPSLVEGWKVEKIQEIYLEALKAYV 


544 


Db 


427 


486 


Qy 


545 


DNRDRDTATVRYARLLSVLTELRTLGNENSELCMTLKLKNRWPPFLAEIWDVMP 599 

III I : :| = MIIIIIIIIIII = II|:| =11111= =1111111111 = 1 
DNRRRPKSGTIFAKLLSVLTELRTLGNQNSEMCFSLKLKNKKLPPFLAEIWDVIP 541 




Db 


487 





RESULT 2 
002035 

ID 002035 PRELIMINARY; PRT; 4 91 AA. 

AC 002 03 5; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 {TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE ECDYSONE RECEPTOR. 

GN ECR . 

OS Tenebrio molitor (Yellow mealworm) . 

OC Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 

OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; 

OC Cucuj if ormia; Tenebrionidae; Tenebrio. 

OX NCBI_TaxID=7067; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98000286 ; PubMed=934223 9 ; 

RA Mouillet J.F., Delbecque J. P., Quennedey B., Delachambre J. ; 

RT "Cloning of two putative ecdysteroid receptor isoforms from Tenebrio 

RT molitor and their developmental expression in the epidermis during 

RT metamorphosis . " ; 

RL Eur. J. Biochem. 248:856-863(1997). 

CC -■- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC -!- SIMILARITY: TO C4-TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; Y11533; CAA722 96.1; -. 

DR HSSP; P20393; 1A6Y. 

DR InterPro; IPR000536; Hormone_rec_lig . 

DR InterPro; IPR0 005 04; RRM. 

DR InterPro; I PRO 01 72 3; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR SMART; SM00430; HOLI ; 1. 

DR SMART; SM003 99; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. 

DR PROSITE; PS00030; RRM_RNP_1 ; UNKNOWN_l . 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc- finger. 

SQ SEQUENCE 491 AA; 56096 MW; F997E90A56A789D1 CRC64 ; 



Query Match 4 5.5%; Score 14 34.5; DB 5; Length 4 91; 

Best Local Similarity 65.8%; Pred. No. 8.4e-99; 

Matches 293; Conservative 48; Mismatches 51; Indels 53; Gaps 



Qy 159 SPMSGKIVKEELSPPNSLSGVSSHSDGLKKKKLNHTPSTGWNTSASGPGGGVGGNVLNN 218 



I III : 1 : 1 1 1 : 1 1 : 1 I : I MM 1 1 

Db 90 SSMSG REDLS-PSSLNGYSADSCDSKKKK GP TP 121 

Ov 219 RPPEELCLVCGDRSSGYHYNALTCEGCKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQ 278 

| M | | | | | | | | : | | | | I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I i I i I I I 
Db 122 RQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQ 181 

Ov 279 E C RL KKC LT VGMR P E C WP E VQC AVKRKE KKAQRE KD K PN S TTD ISPEIIKIEP TE 334 

| | | | | | | I : I I I I I I I I I I I I I I I I I I I I I I I I 0 I I I I I I I I M h = i I I I I =1 
Db 182 ECRLKKCLSVGMRPECWPEVQCAVKRKEKKAQKEKDKPNSTTNGSPDVIKIEPELSDSE 241 

Ov 335 MKIECGEPMIMGTPMPTVPYVKPLSSEQKE- -LIHRLVYFQDQYEAPSEKDMKRLTINNQ 392 

: | | | I M I I I M I I I h : I I I I 1 : h I I I II 

Db 242 KTLTNGRNRI SPEQEELILIHRLVYFQNEYEHPSEEDVKR- - IINQ 285 

Ov 393 NMD E YD EEKQSDTTYRIITE MT I LT VQLI VE F AKRLPG FD KL VRE DQ I T LL KACS S E AMM 452 

: |||: :| || = | | II M M II I M I I I I M I I : M I I I M II I i I I I I 
Db 286 PI DGEDQCE I RFRHTTE I T I LTVQLI VEFAKRLPGFDKLLQEDQI ALLKACS SEVMM 342 

Ov 453 FRVARKYDITTDSIVFANNQPFSADSYNKAGLGDAIENQLSFSRFMYNMKVUNAEYALLT 512 

||:||:||: HI 1 = 1 lllh I I I I I I : I : I I : I I I M M I II II I 1 I I M 
Db 343 FRMARRYDVQSDS I LFVNNQPYPRDS YNLAGMGET I EDLLHFCRTMYSMKVDNAE YALLT 402 

OV 513 AIVIFSSR PNLLDGWKVE K I Q E I YLE S L KA YVDNRDRD T AT VR YARL L S VLT E LRTLGNE 572 

llllll Il:|::| Mill II IIMhhl Mill = = I = I I II I I I I I I I I I = 

Db 403 AIVIFSERPSLIEGWKVEKIQEIYLEALRAYVDNRRSPSRGTIFAKLLSVLTELRTLGNQ 462 

Qy 573 NSELCMTLKLKNRWPPFLAEIWDV 597 

I I I : I : = I I I I I = : I M I Mill 
Db 4 63 N S EMC I S LKLKNKKL P P F LD E I WD V 4 87 



RESULT 3 
044337 

ID 04433 7 PRELIMINARY; PRT; 560 AA. 

AC 044337; 

DT 01-JUN-1998 (TrEMBLrel . 06, Created) 

DT 01-JUN-1998 (TrEMBLrel. 06, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE AAMECRA1 . 

GN AAMECRA1 . 

OS Amblyomma americanum. 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; 

OC Parasitiformes; Ixodida; Ixodidae; Amblyomma. 

OX NCBI_TaxID=6943 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Guo X., Harmon M.A. , Laudet V., Mangelsdorf D.J., Palmer M.J.; 

RL Insect Biochem. Mol . Biol. 0:0-0(1997). 

CC -!- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC -!- SIMILARITY: TO C4-TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; AF020187; AAB94566.1; -. 

DR HSSP; P20393; 1A6Y . 

DR Inter Pro; I PRO 0053 6; Hormone_rec_lig . 

DR InterPro; IPR001723; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR0004 7; STROIDFINGER . 

DR SMART; SM00430; HOLI ; 1. 

DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc- finger. 

SQ SEQUENCE 560 AA; 61302 MW; 5054F2D2C037CD91 CRC64 ; 



Query Match 42.6%; Score 1342.5; DB 5; Length 560; 

Best Local Similarity 50.1%; Pred. No. 7.3e-92; 

Matches 298; Conservative 71; Mismatches 133; Indels 93; Gaps 



Qy 23 GGVGGLMSYNRGRGGTEVI I KPRSPAWQVATGGSYHGLPAASDAVI VRSP- PGGHLPGP 81 

|| IN : |: : I hlhl II II M I 

Db 37 GGAGGLAA GS PPALS PNLPS WKVEP RLP SPCVGGAASGD 76 

Qy 82 QQQVPPSRNGCSTLFSDIAGVKRLRPDDWLAVNSPPASSPGTSHIS-YTVISNGGGGGGG 140 

Ml Ihl II I I hi :| = II I 

Db 77 GGPVPP KR VRQDD AG AW ISSPSSQMS VG S L S P P P P LLNG VANS S G 121 

Qy 141 GGGGYNTSPMST-NSYDPYSPMSGKIVKEELSPPNSLSGVSSH SDGLKKKKLNHTPS 196 

Ihl ON III llhll = I- : M Ml I 

Db 122 L S P VS NC S S YD T Y S P RG P - - CKE EMS P S S GGGGLNG YF VD S FGD P KKKKG P A P - 172 

Qy 197 TGWNTSASGPGGGVGGNVLNNRPPEELCLVCGDRSSGYHYNALTCEGCKGFFRRSITKN 256 

I lllllllllhllllMIIIIIIIIMIIIMIII 

Db 173 RQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSITKN 210 

Qy 257 AVYQCKYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECVVPEVQCAVKRKEKKAQREKDK 316 

IIIIIIIIMhIIIIMIIIIMIIIIIhlllllllllll Mhlh II I Ih 

Db 211 A VYQC KYGNNCD I DM YMRRKC Q E C R L KKC L S VGMR PE C WP E YQC A I KRE SKKHQ--KDR 268 

Qy 317 PNSTTDISPEIIK IEPTEMKIECGEPMIMGT PMPTVPYVKPLSSEQ 362 

,11 || : : II : I = = hi Mill I 

Db 269 PNSTTRESPSALMAPSSVGGVSPTSQPMGGGGSSLGSSNHEEDKKPWLS PGVKPLSSSQ 328 

Qy 363 KELIHRLVYFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRIITEMTILTVQLIV 4 22 

::||::||hl ::| : ||h||h I I I : : : : IIHIIIIIIII 

Db 329 EDLINKLVYYQQEFESPSEEDMKKTT PFPLGDSEEDNQRRFQHITEITILTVQLIV 3 84 

Qy 423 EFAKRLPGFDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYNKA 482 

E I = 1 1 = 1 1 1 1 I IIIIIIIMIIIII II I 11111= llllllllllh: hi I 

Db 38 5 E F S KR VPG FDT LARED Q I T L L KAC S S E VMMLRG ARKYD VKTD S I V F ANNQ P YTRDN YRS A 444 

Qy 483 GLGDAI ENQLS FSRFMYNMKVDNAEYALLTAI VI FSSRPNLLDGWKVEKI QE I YLESLKA 542 

, : : Ml ::||||IIIIIIIIMII 11=1 = 1 llhlll hhh 

Db 445 SVGDSADALFRFCRKMCQLRVDNAEYALLTAIVIFSERPSLVDPHKVERIQEYYIETLRM 504 

Qy 543 YVDNRDRDT AT VR Y ARL L S VLT E LRT LGNENS E LCMT L KLKNR W P P F LAE I WD V 597 

I I :|IMhlllllllll hhl Oh:h Hlllllllh 

Db 505 YSENH - RPPGKNYFARLLSI LTELRTLGNMNAEMCFSLKVQNKKLPPFLAEIWDI 558 



RESULT 4 
044338 

ID 044338 PRELIMINARY; PRT; 5 70 AA. 

AC 044338; 

DT 01-JUN-1998 (TrEMBLrel. 06, Created) 

DT 01-JUN-1998 (TrEMBLrel . 06, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE AAMECRA2 . 

GN AAMECRA2 . 

OS Amblyomma americanum. 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; 

OC Parasitiformes; Ixodida; Ixodidae; Amblyomma. 

OX NCBI_TaxID=6943 ; 

RN [1] 

RP SEQUENCE FROM N . A, 

RA Guo X., Harmon M.A., Laudet V., Mangelsdorf D.J., Palmer M.J.; 

RL Insect Biochem. Mol . Biol. 0:0-0(1997). 

CC -!- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC -!- SIMILARITY: TO C4 -TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; AF020188; AAB94 567.1; 

DR HSSP; P2 0393; 1A6Y . 

DR InterPro; I PRO 0 0 53 6; Hormone_rec_lig . 

DR InterPro; IPR001723; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR SMART ; SM00430; HOLI ; 1. 



DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. 

KW DNA- binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc- finger. 

SQ SEQUENCE 570 AA; 63127 MW; 1B0FAF391AA2033F CRC64 ; 



Query Match 41.6%; Score 1312.5; DB 5; Length 570; 

Best Local Similarity 51.6%; Pred. No. 1.3e-89; 

Matches 2 83; Conservative 70; Mismatches 12 0; Indels 75; Gaps 13; 

Qy 69 IVRSPPGGHLPGPQQQVPPSRNGCSTLFSDIAGVKRLRPDDWLAVNSPPASSPGTSHIS- 127 

: I || Ml Ihl II I I I I 

Db 77 VLQTDDDGGDGGP VPP KRVRQDDAGAWISSPSSQMSVGSLSP 118 

Qy 12 8 YTVISNGGGGGGGGGGGYNTSPMST-NSYDPYSPMSGKIVKEELSPPNSLSGVSSH S 183 

: II I Ihl Mil III llhll = h: : I 

Db 119 PPPLLNGVANSSG LS PVSNCSSYDTYSPRGP- -CKEEMSPSSGGGGLNGYFVDS 170 

Qy 184 DGL KKKKLNHT P S TGWNT S AS G PGGG VGGNVLNNR P P E E LC L VCGDRS S G YH YNALTC E 24 3 

I III I I lllllllllhlllllllllll 

Db 171 FGDPKKKKGPAP RQQEELCLVCGDRASGYHYNALTCE 207 

Qy 244 GCKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECWPEVQCAV 3 03 

IIIIIIIIIIIIIIIIIIIMIIhlllllllllllllllllhlllllllllll llh 
Db 208 GCKGFFRRSITKNAVYQCKYGNNCDIDMYMRRKCQECRLKKCLSVGMRPECWPEYQCAI 2 67 

Qy 304 KRKEKKAQREKDKPNSTTDISPEIIK IEPTEMKIECGEPMIMGT PM 349 

Ih II I Ihlllll II : : || : | : : |: 

Db 268 KRESKKHQ- -KDRPNSTTRESPSALMAPSSVGGVSPTSQPMGGGGSSLGSSNHEEDKKPV 325 

Qy 350 PTVPYVKPLSSEQKELIHRLVYFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRI 4 09 

I IIMII |::||::|||:| = = I = 1 I I = ! I I = I | |: = :: 

Db 326 VLSPGVKPLSSSQEDLINKLVYYQQEFESPSEEDMKKTT PFPLGDSEEDNQRRFQH 381 

Qy 410 ITEMTILTVQLIVEFAKRLPGFDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFA 469 

llhllllllllllhlhllll | I ■ MM || I Mill: lllllll 

Db 382 ITEITILTVQLIVEFSKRVPGFDTLAREDQITLLKACSSEVMMLRGARKYDVKTDSIVFA 441 

Qy 4 70 NNQPFSADSYNKAGLGDAIENQLSFSRFMYNMKVDNAEYALLTAIVIFSSRPNLLDGWKV 52 9 

I hi I =11= = III -I Ml IM 11 = 1 = 1 II 

Db 442 NNQPYTRDNYRSASVGDSADALFRFCRKMCQLRVDNAEYALLTAIVIFSERPSLVDPHKV 501 
Qy 530 EKIQEIYLESLKAYVDNRDRDTATVRYARLLSVLTELRTLGNENSELCMTLKLKNRWPP 589 

hill hhh I M I MlllhlllllllM MM MM:,: : || 

Db 502 ERI QEYYI ETLRMYSENH - RP PGKNYFARLLS I LTELRTLGNMNAEMCFSLKVQNKKLPP 560 

Qy 590 FLAEIWDV 5 97 

MMM: 
Db 561 FLAEIWDI 568 



RESULT 5 
077240 

ID 07724 0 PRELIMINARY; PRT; 513 AA. 

AC 07724 0; 

DT 01-NOV-1998 (TrEMBLrel. 08, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE ECDYSONE RECEPTOR ISOFORM A. 

GN ECR . 

OS Choristoneura fumiferana (Spruce budworm) . 

OC Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 

OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; 

OC Tortricoidea; Tortricidae; Tortricinae; Choristoneura. 

OX NCBI_TaxID=7141; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Perera S.C., Ladd T.R., Dhadialla T.S., Krell P.J., Sohi S.S., 

RA Retnakaran A., Palli S.R.; 

RT "Studies of two ecdysone receptor isoforms of the spruce budworm, 



RT Choristoneura fumif erana . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

CC SUBCELLULAR LOCATION: NUCLEAR {BY SIMILARITY). 

CC -!- SIMILARITY: TO C4 -TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; AF092030; AAC61596.2; -. 

DR HSSP; P20393; 1A6Y. 

DR InterPro; IPR000536; Hormone_rec_lig . 

DR InterPro; I PRO 0172 3; Strdhormone_receptor . 

DR InterPro; IPR001628; zf -C4 . 

DR Pfam; PF00104; hormone_rec ; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR SMART; SM00430; HOLI ; 1. 

DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR ; 1. 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc -finger. 

SQ SEQUENCE 513 AA; 57621 MW; F110540A3030E602 CRC64 ; 



Query Match 41.0%; Score 12 93.5; DB 5; Length 513; 

Best Local Similarity 51.4%; Pred. No. 2.9e-88; 

Matches 276; Conservative 76; Mismatches 118; Indels 67; Gaps 13; 

Qy 76 GHLPGPQQQVPPSRNGCSTLFSDIAGVKRLRPDDWLAVNSPPASSPGTSHISYTVISNGG 135 

I III : I II : I I Ihl hll : I I 

Db 11 GVLPGQVKAEPGVHNG QVNGHVR DWMA - GG AGAN S P S PG AVAQ P Q PNNG - 58 

Qy 136 GGGGGGGGGYNTSPMSTNSYDPYSPMSGKIVKEELSPPNSLSGVSSHSDGLKKKKLNHTP 195 

:|| = |: II I I I I =111 HUM =1-1 h = -II 
Db 59 YSSPLSSGSYGPYSP-NGKIGREELSPASSINGCSTDGEARRQKK 102 

Qy 196 STGWNTSASGPGGGVGGNVLNNRPPEELCLVCGDRSSGYHYNALTCEGCKGFFRRSITK 255 

II I lllllllllhllllllllMIIMIIIIIhll 

Db 103 GPA PRQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSVTK 143 

Qy 256 NAVYQCKYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECWPEVQCAVKRKEKKAQREKD 315 

I I I I Ihh Ihlllllllllllllllll lllllllllll llhlllllllhlll 
Db 144 NAVYICKFGHACEMDMYMRRKCQECRLKKCLAVGMRPECWPETQCAMKRKEKKAQKEKD 2 03 

Qy 316 K-PNSTT DISPEIIKIEPTEMKIE CGEPMIMGTPMPTVPYVKPLSSEQ 362 

I I III II I- II : : - =1 h= I 

Db 204 KLPVSTTTVDDHMPPIMQCEPPPPEAARIHEWPRFLSDKLLETNRQKNI P QLTANQ 260 

Qy 363 KELIHRLVYFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRIITEMTILTVQLIV 422 

: II lh::|| II I I = = I = I I = I I I II -III =1 IIIIIIMIIIII 
Db 2 61 QFLIARLIWYQDGYEQPSDEDLKRITQTWQQAD- -DENEESDTPFRQITEMTILTVQLIV 318 

Qy 423 EFAKRLPGFDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYNKA 4 82 

I I I I I I I I h llllllllllll II Nihil = 11-11111 - hi II 
Db 319 E F AKG L PG F AK I S Q PDQ I TL LKAC S S E VMMLR VARR YD AAS DS VL F ANNQAYT RDN YRKA 378 

Qy 483 GLGDAIENQLSFSRFMYNMKVDNAEYALLTAIVIFSSRPNLLDGWKVEKIQEIYLESLKA 542 

h Ih I II Ihl HI lllllhllll II I 11 = 11 II = 1 = 

Db 3 79 GMAYVIEDLLHFCRCMYSMALDNIHYALLTAWIFSDRPGLEQPQLVEEIQRYYLNTLRI 438 

Qy 543 YVDNRDRDTA- -TVRYARLLSVLTELRTLGNENSELCMTLKLKNRWPPFLAEIWDV 597 

h h = 1 I I ::|hhllllll 01 H-llllll =1111 I II I I 
Db 439 YILNQLSGSARSSVIYGKILSILSELRTLGMQNSNMCISLKLKNRKLPPFLEEIWDV 495 



RESULT 6 
044336 

ID 044 336 PRELIMINARY; PRT; 444 AA . 

AC 044336; 

DT 01-JUN-1998 (TrEMBLrel . 06, Created) 

DT 01-JUN-1998 (TrEMBLrel. 06, Last sequence update) 

DT 01-JUN-2 001 (TrEMBLrel. 17, Last annotation update) 

DE AAMECRA3 . 

GN AAMECRA3 . 



OS Amblyomma americanum. 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acan, 

OC Parasitiformes; Ixodida; Ixodidae; Amblyomma. 

OX NCBI_TaxID-6943 ; 

RN [1] 

RP SEQUENCE FROM N.A. «- — 

RA quo X., Harmon M.A., Laudet V., Mangelsdorf D.J., Palmer M.J.; 

RL Insect Biochem. Mol . Biol. 0:0-0(1997). 

CC -'- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY) . 

CC -1- SIMILARITY: TO C4 -TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; AF020186; AAB94565.1; -. 

DR HSSP; P20393; 1A6Y. 

InterPro; IPR000536; Hormone_rec_lig . 



PROS lib; f&uuuji; ^u^uc^.^^^- - 

DNA-binding; Nuclear protein; Receptor; Transcription regulation, 



DR ■ 

DR InterPro; IPR001723; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR SMART; SM00430; HOLI ; 1. 

DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR ; 1 
KW 

KW Zinc-finger. 

SQ SEQUENCE 444 AA; 50283 MW; 3B087EEA29074DB7 CRC64 ; 

Query Match 40.1%; Score 1265.5; DBS; Length 444; 

Best Local Similarity 55.9%; Pred. No. 2.9e-86; 

Matches 255; Conservative 70; Mismatches 84; Indels 47, Gaps 

E E L S P PN S LS G VS S H SDG LKKKKLNHT P STG WNT S AS G PGGG VGGNVLNN 218 

' ' ^IpyT---AAAPEDFWr!cEEM--SPSSG GGG LNG Y F VD S FGD P KKKKG 53 

-RPPEELCLVCGDRSSGYHYNALTCEGCKGFFRRSITKNAWQCKY^ 275 

I ll!lllllll : llllillllllllNMIIIIiNNIiMII^ 



Qy 


168 


Db 


8 


Qy 


219 


Db 


54 


Qy 


276 


Db 


114 


Qy 


330 


Db 


172 


Qy 


382 


Db 


232 


Qy 


442 


Db 


288 


Qy 


502 


Db 


348 


Qy 


562 


Db 


407 



KCQEC RLKKC LT VGMR P E CW P E VQC AVKRKE KKAQRE KD KPNS TTD I S PE 1 1 K 329 

III IIIIIIIIHIM INI IN IM-U: M 1 ii;UU' IL 



WW l-ll-IM 



KDMKRLTINNQN^EYDEBKQSDTTYRIITEm 

.111.1 I |: : := | | | : I I I I I I I I I I I : I l : I I I I I HMH 

EDMKKTT- - - -PFPLGDSEEDNQRRFQHITEITILTVQLIVEFSKRVPGFDTLAREDQIT 

LLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYNKAGLGDAIENQLSFSRFM 
LLKACSSEVM^^ 

KVDNAE YALLTAI V I F S S RPNLLDGWKVE KI QE I YLE S LKAYVDNRDRDT ATVRYARLLS 

= I i M 1 1 i II : I ■ I Ill-Ill I J: I ± ' ™ U i 



441 



501 



561 



, i hhl =lh = h : 1 1 1 1 1 



RESULT 7 

076246 ™ 
ID 076246 PRELIMINARY; PRT; 518 AA. 

AC 076246; 

DT 01-NOV-1998 (TrEMBLrel. 08, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last annotation update) 



DE ECDYSTEROID RECEPTOR. 

/im UPECR 

OS Uca pugilator (Atlantic sand fiddler crab) (Celuca pugilator) . 

OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; 

OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Br *f^"[ 

OC Eubrachyura; Ocypodoidea; Ocypodidae; Ocypodinae; Uca complex. Celuca. 

OX NCBI_TaxID=6772; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98368787; PubMed=9705089 ; 

RA Chung A.C., Durica D.S., Clifton S.W., Roe B.A., Hopkins P.M.; 

"Cloning of crustacean ecdysteroid receptor and retinoxd-X receptor 

rt gene nomologs and elevation of retinoid-X receptor mRNA by retinoxc 

RT acid."; 

RL Mol. Cell. Endocrinol. 139:209-227(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Durica D.S., Wu X., Anilkumar G . , Hopkins P.M., Chung A.C.-K.; 

RL Submitted (APR-2001) to the EMBL/GenBank/DDBJ databases. 

rr -■- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC -!- SIMILARITY: TO C4-TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; AF034086; AAC33432.2; -. 

DR HSSP; P20393; 1A6Y. 

DR InterPro; I PRO 00 53 6; Hormone_rec_lig - 

DR InterPro; IPR001723; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec ; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER . 

DR SMART; SM0 043 0; HOLI; 1. 

DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. _ 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc -finger. 

FT VARIANT 261 266 AAPISP -> VAPISP. 

SQ SEQUENCE 518 AA; 57408 MW; DD39B1F83753A878 CRC64; 

Query Match 39.9%; Score 1258; DB 5; Length 518; 

Best Local Similarity 52.9%; Pred. No. 1.3e-85; 

Matches 279; Conservative 57; Mismatches 101; Indels 90; Gaps 

Ov 86 PPSRNGCSTLFSDIAGVKRLRPDDW-LAVNSPPASSP-GTSHISYTVISNGGGGGGGGGG 143 

||: I III I I II =111 I I I = I m 1 m 

Db 64 ppT SLASSDIGEVDL DFWDLDLNSP- -SPPHGMASVAST N 101 

Ov 144 GYNTSPMSTNSYDPYSPMSGKIVKEELSPPNSLS - -GVSSHSDGLKKKKLNHTPSTGWN 201 

:| : | | : | | : : : : | | | : I I I I 1= I I I I I I 
Db 102 ALLLN PRAVAS PSDTSSLSG RDDMSPPSSL S NFG ADS YGD - LKKKK 146 

Db 147 gp IPRQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSITKNAVYQC 193 

Qy 262 KYGNNCEIDMYMRRKCQECRLKKCLTVGMRPECWPEVQCAVKRKEKI^QREKDKPNSTT 321 

: 1 1 l 1 1 1 : , i , i , 1 1 : 1 1 : i 1 1 1 1 1 1 1 1 1 1 1 ii iii-iii i = iii : w 

Db 194 KYGimCEMD^RRKCQECRLKKCLOTGMRPECWPESQCQVKREQKKA-RDKDKTYPS- 251 

Qy 322 DISPEIIKIEPTEMKIECGEPMIMGTPMPTVPY VKPLSSEQKELIHRLV 370 

1 | | : || I I I I : I I : I I I : I I 

Db 252 LGSPIAEDKAAPISPVSKDMSAAPRUJVKPLTREQEELINTLV 294 

430 



Qy 371 YFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRIITEOTILTVQLIVEFAKRLPG 

|:| ::: | |:| |:|:: I Ml II = I I I I I I I I I I I M I I I : I ! I I I 

Db 295 YYQEEFEQPTEADVKKIRFN FDGEDTSDMRFRHITEMTILTVQLIVEFSKQLPG 34 8 

0 y 431 FDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYNKAGLGDAIEN 490 

I Mlllllllllllll II I II Ml Mllll II l :: II I „„ 

Db 349 FATLQREDQITLL^CSSEVMMLRAARRYDAKTDSIVFGNNYPYTQASYALAGI^ESAEI 408 



Qy 4 91 QLSFSRFMYNMKVDNAEYALLTAIVIFSSRPNLLDGWKVEKIQEIYLESLKAYVDNRDRD 550 

I I : lllllllllll II III llll : Mlhllillhlhlhll 
Db 409 LFRFCRSLCKMKVDNAEYALLAAIAIFSERPNLKELKKVEKLQEIYLEALKSYVENRRLP 468 

Qy 551 T AT VR YARLL S VLT E LRTLGNENS E LCMT LKLKNR W P P F LAE I WD V 597 

: : :|:||::|IMI1III Mhl = 1 llh = 1 I t I 1 I I 1 I I 
Db 469 RSNMVFAKLLNILTELRTLGNINSEMCFSLTLKNKRLPPFLAEIWDV 515 



09346470Results 

SEQ ID NO: 10 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


3152 


100 


.0 


599 


22 


AAB67101 


M persicae ecdyson 


2 


2356 


74 


. 7 


450 


20 


AAY28606 


M. persicae EcR po 


3 


1246 


39 


.5 


546 


21 


AAY87470 


European corn bore 


4 


1238 


39 


.3 


757 


20 


AAY28603 


EcR polypeptide su 


5 


1238 


39 


.3 


757 


22 


AAB67096 


L cuprina ecdysone 


6 


1224.5 


38 


.8 


757 


19 


AAW71297 


Lucilia cuprina ec 


7 


1223 


38 


.8 


576 


18 


AAW06533 


Heliothis ecdysone 


8 


1221 


38 


. 7 


878 


12 


AAR13793 


Ecdysone receptor . 


9 


1212 


38 


.5 


878 


14 


AAR32889 


DHR23alpha protein 


10 


1171 


37 


.2 


746 


18 


AAW33655 


Modified ecdysone 


11 


1160 


36 


.8 


746 


21 


AAY70357 


Heterodimeric nucl 


12 


1155 


36 


.6 


764 


21 


AAY70358 


Heterodimeric nucl 


13 


1149 


36 


.5 


746 


18 


AAW33 654 


Modified ecdysone 


14 


1131 


35 


.9 


1041 


18 


AAW33656 


Modified ecdysone 


15 


1129.5 


35 


.8 


606 


20 


AAW95701 


Bombyx mori nuclea 


16 


805 


25 


.5 


319 


18 


AAW06534 


Spodoptera ecdyson 


17 


791 


25 


.1 


550 


19 


AAW45513 


Ecdysone receptor 


18 


791 


25 


.1 


550 


19 


AAW39139 


Drosophila ecdyson 


19 


791 


25 


1 


550 


21 


AAY67558 


Drosophila ecdyson 


20 


791 


25 


1 


550 


21 


AAY78887 


Ecdysone receptor 


21 


777 


24 


7 


195 


20 


AAY28605 


DNA- binding domain 


22 


777 


24 


7 


195 


22 


AAB67100 


M Dersirap prHvpnn 


23 


731 


23 


2 


445 


21 


AAY32374 


Mouse CNREB-1. Mu 


24 


719.5 


22 


8 


440 


14 


AAR33744 


XR2 Homo c; a -n H f>r\ d 


25 


714 


22 


7 


447 


17 


AAW03326 


T iYR - 3 1 nVi nmVian 


26 


689.5 


21 


9 


461 


15 


AAR52980 


Uiima n Y"f» r 1 oml"i "i nanh 

n Li. Ik 10.11 J. C^LJllLU -LllCLllU 


27 


689.5 


21 


9 


461 


17 


AAR97982 


Human steroid rece 


28 


689.5 


21 


9 


461 


17 


AAR98140 


NER receptor poten 


29 


684 


21 


7 


460 


16 


AAR74738 


Human ubiquitous n 


30 


682.5 


21 


7 


461 


17 


AAR96234 


Human foetal lung 


31 


682 


21 


6 


460 


18 


AAW25034 


Human ubiquitous n 


32 


678 


21. 


5 


446 


17 


AAR99736 


Retinoid X recepto 


33 


678 


21 . 


5 


484 


17 


AAR99735 


Retinoid X recepto 


34 


673.5 


21. 


4 


443 


16 


AAR74739 


Rat ubiquitous nuc 


35 


673.5 


21. 


4 


443 


18 


AAW25 035 


Rat ubiquitous nuc 


36 


671 


21 . 


3 


446 


17 


AAR94169 


OR-1 orphan recept 


37 


670 


21. 


3 


469 


17 


AAW03448 


Farnesoid- activate 


38 


663.5 


21. 


1 


451 


17 


AAR99739 


Retinoid X recepto 


39 


661 


21. 


0 


472 


19 


AAW40072 


Human retinoid rec 


40 


648.5 


20. 


6 


477 


22 


AAE06680 


Human nuclear norm 


41 


630.5 


20. 


0 


433 


17 


AAR88452 


Retinoic acid rece 


42 


526.5 


16. 


7 


462 


12 


AAR1054 7 


Murine Retinoic Ac 


43 


526 . 5 


16. 


7 


462 


16 


AAR84724 


Murine retinoic ac 


44 


520.5 


16. 


5 


462 


13 


AAR27534 


RAR-alpha protein. 


45 


520.5 


16. 


5 


462 


16 


AAR68023 


RAR-alpha. Homo s 



ALIGNMENTS 



RESULT 1 
AAB67101 

ID AAB6 7101 standard; Protein; 599 AA. 
XX 

AC AAB67101; 
XX 

DT 10-APR-2001 {first entry) 
XX 

DE M persicae ecdysone receptor EcR subunit SEQ ID NO- 14 
XX 

KW Insect; steroid hormone receptor; juvenile hormone receptor; fly; 



KW EcR subunit; USP subunit ; insecticide. 
XX 

OS Myzus persicae. 
XX 

PN WO200102436-A1. 
XX 

PD ll-JAN-2001. 
XX 

PF 30-JUN-2000; 2000WO-AU00799 . 
XX 

PR 01-JUL-1999; 99US-0346470 . 
XX 

PA (CSIR ) COMMONWEALTH SCI & IND RES ORG. 
XX 

PI Hill RJ, Hannan GN; 
XX 

DR WPI; 2001-080981/09. 

DR N-PSDB; AAF32137. 
XX 

PT New nucleic acid molecule for the regulation of gene expression in 

PT insects - 

XX 

PS Claim 1; Page 141-144; 172pp; English. 
XX 

CC The present invention provides the protein and coding sequences of the 

CC Bemisia tabaci (Silverleaf whitefly) , Myzus persicae (aphid) and Lucilia 

CC cuprina (sheep blowfly) steroid and juvenile hormone steroid receptor 

CC subunits EcR and USP. These can be used to screen for agents with 

CC insecticidal activity. 
XX 

SQ Sequence 599 AA; 



Query Match 100.0%; Score 3152; DB 22; Length 599; 

Best Local Similarity 100.0%; Pred. No. 4.6e-231; 

599; Conservative 0; Mismatches 0; Indels 0; Gaps 

MMDQKCDVGGGGVAAAAAGIGGGGVGGLMSYNRGRGGTEVIIKPRSPAWQVATGGSYHG 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
mmdqkcdvggggvaaaaagiggggvgglmsynrgrggteviikpr spawqvatggsyhg 6 0 

L P AAS D AV I VRS P PGGH L PG P QQQ V P P S RNGC S TL F S D I AG VKRLR PDDWLAVNS P PAS S 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

lpaasdavivrsppgghlpgpqqqvppsrngcstlf sdiagvkrlrpddwlavnsppass 120 
PGT S H I S YT VI SNGGGGGGGGGGG YNT S PM S TNS YD P YS PMSG K I VKE ELSPPNSLS G VS 180 

1IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

pgt shi sy t vi sngggggggggggynt spms t nsydpy spmsgki vkee 1 sppns 1 sgvs 180 

SHSDGLKKKKLNHTPSTGWNTSASGPGGGVGGNVLNNRPPEELCLVCGDRSSGYHYNAL 240 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIII 
shsdglkkkklnhtpstgwntsasgpgggvggnvlnnrppeelclvcgdrssgyhynal 24 0 

TCEGCKGFFRRS I TKNAVYQCKYGNNCE IDMYMRRKCQECRLKKCLTVGMRPECWPEVQ 300 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIM 

tcegckgf f rrsitknavyqckygnnceidmymrrkcqecrlkkcltvgmrpecwpevq 3 00 

CAVKRKEKKAQREKDKPNSTTDISPEIIKIEPTEMKIECGEPMIMGTPMPTVPYVKPLSS 360 
IIIIIII1IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



EQKELIHRLVYFQDQYEAPSEKDMKRLTINNQNMDEYDEEKQSDTTYRIITEMTILTVQL 420 

I II I I I II ! M i: ' I : II l I I M I ! i M M M I I I II I II L I 'I I i I li II 

eqkelihrlvyfqdqyeapsekdmkrltinnqnmdeydeekqsdttyriitemtiltvql 42 0 
IVEFAKRLPGFDKLVREDQITLLKACSSEAMMFRVARKYDITTDSIVFANNQPFSADSYN 480 

1 1 i : 1 1 ; ! ; 1 1 ! 1 1 1 : i i 1 1 1 i : - 

ivef akrlpgf dklvredqitllkacsseammf rvarkydittdsivf annqpf sadsyn 4 80 
KAGLGDAIENQLSFSRFMYNMKVDNAEYALLTAIVIFSSRPNLLDGWKVEKIQEIYLESL 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



Matches 


Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 



Db 481 kaglgdaienqlsf srfmynmkvdnaeyalltaivif ssrpnlldgwkvekiqeiylesl 540 



Qy 541 KAYVDNRDRDTATVRYARLLSVLTELRTLGNENSELCMTLKLKNRWPPFLAEIWDVMP 599 

IIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIMII 

Db 541 kayvdnrdrdtatvryarllsvltelrtlgnenselcmtlklknrwppf laeiwdvmp 599 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1160 


36. 


8 


746 


3 


US-09-144-759-18 


Sequence 


18, 


Appl 


2 


1155 


36. 


6 


764 


3 


US-09-144-759-20 


Sequence 


20, 


Appl 


3 


1129.5 


35 . 


8 


606 


4 


US-08-891-298-3 


Sequence 


3, 


Appli 


4 


791 


25 . 


1 


550 


3 


US-08-659-188-18 


Sequence 


18, 


Appl 


5 


791 


25. 


1 


550 


3 


US-08-655-227-18 


Sequence 


18, 


Appl 


6 


791 


25. 


1 


550 


3 


US-08-655-241-18 


Sequence 


18, 


Appl 


7 


725.5 


23 . 


0 


440 


1 


US-08-333-358-8 


Sequence 


8, 


Appli 


8 


725.5 


23 


0 


440 


1 


US-08-463-694-8 


Sequence 


8 , 


Appli 


9 


725.5 


23 


0 


440 


1 


US-08-694-501-8 


Sequence 


8 , 


Appli 


10 


714 


22 


7 


447 


1 


US-08-373-935-1 


Sequence 


1, 


Appli 


11 


689.5 


21 


9 


461 


1 


US-08-330-518-2 


Sequence 


2, 


Appli 


12 


689.5 


21 


9 


461 


1 


US-08-330-283-2 


Sequence 


2, 


Appli 


13 


689.5 


21 


9 


461 


2 


US-08-646-248-2 


Sequence 


2, 


Appli 


14 


689.5 


21 


9 


461 


5 


PCT-US95-13924-2 


Sequence 


2, 


Appli 


15 


689.5 


21 


9 


461 


5 


PCT-US95-13931-2 


Sequence 


2, 


Appli 


16 


682 


21 


6 


460 


1 


US-08-342-411A-2 


Sequence 


2, 


Appli 


17 


678 


21 


5 


446 


2 


US-08-372-652-3 


Sequence 


3, 


Appli 


18 


678 


21 


5 


446 


5 


PCT-US95-16311-3 


Sequence 


3, 


Appli 


19 


678 


21 


5 


484 


2 


US-08-372-652-1 


Sequence 


1, 


Appli 


20 


678 


21 


5 


484 


5 


PCT-US95-16311-1 


Sequence 


1, 


Appli 


21 


673.5 


21 


4 


443 


1 


US-08-342-411A-4 


Sequence 


4, 


Appli 


22 


671 


21 


3 


446 


4 


US-08-776-844-2 


Sequence 


2, 


Appli 


23 


670 


21 


3 


469 


3 


US-08-372-183-2 


Sequence 


2, 


Appli 


24 


670 


21 


3 


469 


4 


US-09-469-721-2 


Sequence 


2, 


Appli 


25 


670 


21 


3 


469 


5 


PCT-US95-17023-2 


Sequence 


2, 


Appli 


26 


663.5 


21 


1 


451 


2 


US-08-372-652-2 


Sequence 


2, 


Appli 


27 


663 .5 


21 


1 


451 


5 


PCT-US95-16311-2 


Sequence 


2, 


Appli 


28 


661 


21 


0 


472 


1 


US-08-496-631-2 


Sequence 


2, 


Appli 


29 


630 . 5 


20 


0 


433 


2 


US-08-466-120-2 


Sequence 


2, 


Appli 


30 


630.5 


20 


0 


433 


5 


PCT-US94-07266-2 


Sequence 


2, 


Appli 


31 


520.5 


16 


5 


462 


2 


US-08-592-383-2 


Sequence 


2, 


Appli 


32 


520 . 5 


16 


5 


462 


2 


US -08-095- 72 8B -4 


Sequence 


4, 


Appli 


33 


520.5 


16 


5 


462 


5 


PCT-US92-02320A-4 


Sequence 


4, 


Appli 


34 


515 


16 


3 


403 


2 


US-08-592-383-4 


Sequence 


4, 


Appli 


35 


514 . 5 


16 


3 


454 


4 


US-08-764-870-5 


Sequence 


5, 


Appli 


36 


514 .5 


16 


.3 


454 


4 


US-08-980-115-5 


Sequence 


5, 


Appli 


37 


514 .5 


16 


.3 


454 


6 


5260432-2 


Patent No 


. 5260432 


38 


509.5 


16 


.2 


462 


6 


5171671-2 


Patent No 


. 5171671 


39 


495.5 


15 


.7 
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Location/Qualifiers 

1. .306250 

/organism=" Sinorhizobium meliloti" 
/strain="1021" 
/ db_xre f = 11 1 axon : 3 8 2 " 
488. .2707 
/gene="SMc00954" 
488. .2707 
/gene= ,, SMc00954" 
/functions "miscellaneous ; 
/note=" Product confidence 
Gene name confidence : hypothetical 
predicted by Codon_usage 
predicted by Homology 
predicted by FrameD" 
/codon_start=l 
/transl_table=ll 
/evidence=not_experimental 

/product=" CONSERVED HYPOTHETICAL PROTEIN" 
/prote in_id= " CAC4 622 5.1" 
/db_xref="GI : 15074580" 

/translat ion="MPAQPHLPANAPNIWIMLDDVGPALPDTFGGVIETPTLSRLAQ 
DGLS YNRFHNAAMCS PSRAALLTGRNHHRVGNGQI AELANDWDGYTGRI PRTSATVAK 
VLG YYG YAT AAFG KWHNT PAN ETTA VG P YTD W P AGEG I G FD Y F YG F LAG ESS QWE P AV 
VENTVRVDPSHGKEGYHFTEDMTNKAVTWMKQVHALTPDRPFFMYWAPGAAHGPHHIF 
KDWADKYKGKFDDGWDAMRERIYARQKQLGWIPQNTQLTPRPDTLSGWEDIPEDEKPF 
QRRLME V F AG YTE HAD VQ AG RL LE ALD E LG I RDNT LI F YWGDNG S S AEGQNGT I S E L 



hypothetical/global homology" 
hypothetical 



LAQNGI RTE I KDHIRAMNELGGMDVLGS P KADNM YHAG WAW AG S T PHRS T KLVAAH FG 

GTRTPLWSWPGKIKPDQTPRSQFHHLNDIVPTIYDVLDIQPPKLVDGISQDPLDGVS 

MTYTFDSPEASGQKKEQYFEIMGSRAIYQDEWIASVFGPRIPWKAGLDPAIFKWSPDN 

DLWELHDLSQDYSQAKNVAADHPEKVEALKRAFGVQAEANKVFPVGGGLWSAVFHPED 

APSNPATEFQFTQDWGVPEFTAPKVGARSNLVTIEAELRPDSEGVLYALGAFSGGVA 

LWVENGKLTYEYNLFEIERTRLESSDPLPSGKVNIEVETRKVSSDHAAPLDWIRIDG 

KEVAKGHVPRSAPLAFTANDAFDVGKDSYSPVALAYFDRKPFAFNGKIESLKVEYLK" 

2887. .4062 

/gene="SMc00955" 

2887. .4062 

/gene* n SMc00955" 

/function^ "miscellaneous; hypothetical /global homology" 

/note=" Product confidence : hypothetical 

Gene name confidence : hypothetical 

predicted by Codon_usage 

predicted by FrameD" 

/ codon_s t ar t = 1 

/transl_table=ll 

/evidence=not_experimental 

/product =" CONSERVED HYPOTHETICAL PROTEIN" 
/protein_id="CAC46226 .1" 
/db_xref ="GI : 15074581" 

/ trans 1 at ion= " MLLYGCAGRPTGVLEPVAANPSAAQVEMLVTTTRSRAEKPGEMF 

SGERALSPTFAEMTVSIPPDTVRKAGEVAWPSKLPPNPETDFATLKAVEVDRGAAEKW 

LNTHVRKSPDGSVLVFIHGFNNHFEDAVFRFAQIIHDSGARSVPVLATWPSRGSLLAY 

G YD RE S TN YT RNAVE RL FQ YLARD PE VKE VS I LAH S MGNW LALE S LRQMA I RNDGL PA 

KFRNVMLAAPDVDVDVFGSQIADMGKQRPRFTLFVSRDDRALAVSRRVWGNVSRLGAI 

DPEQSPYKEELAANGITWDLTKIKAGDNLHHTKFAESPEIVRLIGSRLSSGQTLTDS 

RLGLGDH I VAATAGAAHTVGTAAGLWAAPAAI VDQNTRRNYVHHVEALAE PSGATR 11 

4548. .4724 

/gene^"SMc00959" 

4548. .4724 

/gene="SMc00959 M 

/ f unc t ion= " mi s ce 1 1 aneous ; unknown " 

/note=" Product confidence : hypothetical 

Gene name confidence : hypothetical 

predicted by Codon_usage 

predicted by FrameD" 

/codon_start=l 

/transl_table=ll 

/ evidence =not_exper iment al 

/ product =" HYPOTHETICAL /UNKNOWN PROTEIN" 

/protein_id="CAC46227 . 1" 

/db_xref ="GI : 15074582" 

/ 1 r ans 1 at ion= " MSKRELI DTGTDKRYVRRDESGKFKHS VDVGRSLSSDKKQKAKH 
DAKPGQGDKGDRKN " 
4728. .5522 

/gene="xthAl OR SMc00956" 
4728. .5522 

/gene="xthAl OR SMc00956" 
/EC_number="3 .1.11.2" 

/function="macromolecule metabolism; macromolecule 

degradation; degradation of dna" 

/note=" Product confidence : probable 

Gene name confidence : probable 

predicted by Codon_usage 

predicted by Homology 

predicted by FrameD" 

/codon_start=l 

/transl_table=ll 

/evidence=not_experimental 

/product =" PROBABLE EXODEOXYRIBONUCLEASE III PROTEIN" 
/protein_id="CAC4622 8 . 1" 
/db_xref="GI :15074583" 

/ translations" MKIATYNVNGVNGRLGVLLRWLEEASPDWCLQELKAPDPKFPV 

KAIEAAGYGAIWHGQKSWNGVAILARDREPTLTRKGLPGDPDDTHSRYIEAAVEGMVI 

GCLYLPNGNPYPGPKFEYKLAWFHRLTAYAAELLELDVPVILAGDYNVMPTELDVYKP 

E RWVND AL FR I E VRD A YHRL LEQG WTD ALRQ LH PG E R VYT FWD Y FRNAF ARD AGLR I D 

HLLLS PHVTLRLSAAGVDRHVRGWEHTSDHAPAWIELSDGPAEEDQ " 

5519. .5737 

/gene="SMc00958" 



CDS 5519. .5737 

/gene= ,, SMc00958" 

/function= "miscellaneous; unknown" 

/note=" Product confidence : hypothetical 

Gene name confidence : hypothetical 

predicted by Codon_usage 

predicted by FrameD" 

/ codon_s t ar t = 1 

/transl_table=ll 

/ evidence =not_experimental 

/product = "HYPOTHETICAL/ UNKNOWN PROTEIN" 

/protein_id= » CAC4 6229.1" 

/db_xref="GI :15074584" 

/translations "MTMNDWRAELRLELEKLVDAAWAGARQRDVYAAILDEIGQLRL 
AYERDPDPADDISEQVIEEPSNNWPAAD" 
repeat_region 5839. .6005 

/note="Sm-4 OR SMc04555 
REPEAT SM-4 

predicted by Homology" 

/ evidence=not_experimental 
gene complement (6137 . .9652) 

/gene="mfd OR SMc04450" 
CDS complement (6137 . .9652) 

/gene="mfd OR SMc04450" 

/function="macromolecule metabolism; macromolecule 
synthesis, modification; dna - replication, repair, 
restr . /modif . " 

/notes "Product confidence : probable 
Gene name confidence : probable 
predicted by Codon_usage 
predicted by Homology 
predicted by FrameD" 
/ codon_start=l 
/transl_table=ll 
/evidence=not_experimental 

/product =" PROBABLE TRANSCRIPTION-REPAIR COUPLING FACTOR 
(TRCF) PROTEIN" 
/protein_id="CAC46230.1" 
/db_xref ="GI : 15074585" 

/translation^ "MTMIPGFDPRKILAATREITIGPVPTGAEALVLAELARAGAPVA 
YI LSDGQKVADLEQVLGFVAPDI PVLTLPGWDCLPYDRVS PSADTSARRLAALSALI A 
HRRK PH PA I VL VT I NAALQRI S P QD V I E S LAFT AR PGNQ I RMDD LAARLE RNG F E RV P 
TVREMGEFAVRGGILDVYVPGSGEPLRLDFFGDTLEAIRSFDPASQRTIGQVRSLDLN 
PMSEVSLTPETISHFRKQYLSLFGAATRDDALYQAVSEGRRYAGMEHWLPLFYDRLET 
VFDYLDGFRIVTDHLAREAAAERSKLVLDYYDARLASASPGKSQVTQGTPYKPVPPDM 
LYLTAKGFGEALNDLNAVRLS PFTEHEGEARQWNI EARQGLRWAKPAGEADNDGTRT 
NVFDQAVKHIAEKRAKGAKVIVSGWTEGSLDRLLQVLAEHGLANIRPVKALSDIGSLK 
PGEAASAVLSLEAGFETGDLWIGEQDILGDRLVRRSKRRKRGADFIAEVTGLDEGSY 
WHAEHGIGRFVGLRTIEAAGAPHDCLELVYADDAKLFLPVENIELLSRYGSEGTDAV 
LDKLGGVAWQARKAKLKKRLLDMAGGLIRIAAERHTRHAPVLAAQDGVYDEFAARFPY 
DETEDQLNSIDAVRDDLGRGRPMDRLVCGDVGFGKTEVALRAAFIAAMNGVQVAVWP 
TTLLARQHFKTFSDRFRGLPIRIQQASRLVGSKDLALTKKEVAEGKTDIWGTHALLG 
SS IKFANLGLLI IDEEQHFGVKHKERLKELKTDVHVLTLSATPI PRTLQLALTGVREL 
SLITTPPVDRMAVRTFISPFDALVIRETLMREHYRGGQSFYVCPRVSDLPEIHDFLKS 
DVPELKVAVAHGQMPATELEDIMNAFYEGRYDVLLSTTIVESGLDVPTANTLIVHRAD 
MFGLAQLYQLRGRVGRSKVRAFALFTLPVNKTLTGPAERRLKVLQSLDTLGAGFQLAS 
HDLDIRGAGNLLGEEQSGHIKEVGFELYQQMLEEAVAELKGEEEIHDTGWSPQISVGT 
PVMIPEEYVPDLNLRLGLYRRLGELTDLKEIDGFGAELIDRFGPLPTEVQHLLKIVYV 
KSLCRTANVEKLDAGPKGVWQFRNKEFPNPAALVGYIAKQGTVAKIRPDQSIFFQRE 
LATPEKRLSGAAMVMTQLAALAKAG " 

gene complement (9655 . .9960) 

/gene="SMc00227" 

CDS complement (9655 . .9960) 

/gene= ,, SMc00227 ,, 

Query Match 1.8%; Score 33; DB 1; Length 3062 50; 

Best Local Similarity 100.0%; Pred. No. 3e-06; 

Matches 33; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



399 cggcggcggcggtggcggcggtggcggcggtgg 431 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



Db 277983 CGGCGGCGGCGGTGGCGGCGGTGGCGGCGGTGG 278015 



RESULT 2 

CHPRGITX 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

JOURNAL 
REFERENCE 

AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
COMMENT 

FEATURES 

source 



CHPRGITX 2512 bp DNA PRI 

Chimpanzee 28S ribosomal RNA gene fragment. 
M30950 

M30950.1 GI:176891 
2 8S ribosomal RNA. 
Chimpanzee DNA. 
Pan troglodytes 
Eukaryota; Metazoa; Chordata 
Mammalia; Eutheria; Primates 

1 (bases 1 to 2512) 
Gonzalez , I . L . , Sylvester , J . E 
Schmickel,R.D. 
Unpublished 

2 (bases 193 to 261; 375 to 945 
2190 to 2204) 

Sylvester , J . E 



rRNA 



BASE COUNT 
ORIGIN 



27-APR-1993 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Pan. 

Smith, T.F., Stambolian, D . and 



Smith, T.F., Stambolian, D . and 



1042 to 1079; 1334 to 1357; 

to 1958, 
Gonzalez, I .L. , 
Schmickel,R.D. 

Ribosomal RNA gene sequences and hominoid phylogeny 

Mol. Biol. Evol. 7, 203-219 (1990) 

90294723 

Draft entry and computer -readable sequence for [2] , [1] kindly 
submitted by I.L.Gonzalez, 20 -DEC- 1989. 

Locat ion/Qual i f ier s 

1. .2512 

/organism="Pan troglodytes" 

/db_xref= ,, taxon:9598 M 

<1. .>2512 

/product="28S ribosomal RNA" 
397 a 837 c 924 g 354 t 
Chromosomes 14, 15, 17, 22, 23. 



1742 



Query Match 1.7%; Score 30; DB 9; Length 2512; 

Best Local Similarity 100.0%; Pred. No. 9.1e-05; 

Matches 30; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 399 cggcggcggcggtggcggcggtggcggcgg 42 8 

Illlllllllllllllllllllllllllll 
Db 4 96 CGGCGGCGGCGGTGGCGGCGGTGGCGGCGG 525 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1797 


100 


0 


1797 


22 


AAF32137 


M persicae ecdyson 


2 


1350 


75 


1 


1353 


20 


AAX90669 


M. persicae ecdyso 


3 


424 


23 


6 


585 


20 


AAX90668 


M. persicae ecdyso 


4 


424 


23 


6 


585 


22 


AAF32134 


M persicae ecdyson 


5 


28 


1 


6 


379 


21 


AAC56804 


Eucalyptus grandis 


6 


28 


1 


6 


2850 


20 


AAV64278 


Human EGR-3 cDNA. 


7 


23 


1 


3 


23 


20 


AAX90678 


Apl primer . Synth 


8 


23 


1 


3 


23 


22 


AAF32146 


M persicae ecdyson 


9 


23 


1 


3 


208 


20 


AAX90672 


EcR probe 1 . Synt 


10 


23 


1 


3 


208 


22 


AAF32135 


M persicae ecdyson 


11 


23 


1 


3 


2822 


22 


AAH24912 


Nucleotide "sequenc 


12 


22 


1 


2 


1677 


17 


AAT31932 


Retinoid X recepto 


13 


22 


1 


2 


1787 


17 


AAT31928 


Retinoid X recepto 


14 


22 


1 


2 


48300 


22 


AAF61281 


N. magadaii bacter 


15 


21 


1 


2 


76 


21 


AAC17910 


Human secreted pro 


16 


21 


1 


2 


126 


15 


AAQ73442 


Crohn's disease/ul 


17 


21 


1 


2 


300 


21 


AAA35327 


Myrtaceae microsat 


18 


21 


1 


2 


800 


22 


AAH03632 


Human cDNA clone ( 


19 


21 


1 


2 


1028 


13 


AAQ27091 


XTY26 probe. Homo 


20 


21 


1 


2 


1028 


22 


AAA91542 


1 . Okb PstI f ragmen 





21 


21 


1 


2 


1028 


22 


AAS01347 


Human Fragile X Sy 


c 


22 


21 


1 


2 


1414 


22 


AAH17027 


Human cDNA seguenc 




23 


21 


1 


2 


1601 


22 


AAH13799 


Human cDNA sequenc 




24 


21 


1 


2 


1608 


21 


AAZ22298 


Human potassium ch 


c 


25 


21 


1 


2 


2036 


19 


AAV2 954 0 


Homo sapiens D8725 


c 


26 


21 


1 


2 


2075 


18 


AAT75444 


cDNA encoding oste 


c 


27 


21 


1 


2 


2205 


19 


AAV04680 


Human presenilin i 


c 


28 


21 


1 


2 


2430 


16 


AAQ83161 


DNA encoding malic 


c 


29 


21 


1 


2 


3517 


17 


AAT31287 


Rabbit poly-immuno 




30 


21 


1 


2 


5820 


18 


AAT71205 


HaSNPV polyhedrin 




31 


21 


1 


2 


6530 


14 


AAQ51557 


Loricrin gene . Ho 




32 


21 


1 


2 


6530 


20 


AAZ22072 


Nucleotide sequenc 




33 


21 


1 


2 


6530 


22 


AAC68948 


Mouse loricrin gen 




34 


21 


1 


2 


7860 


17 


AAT44380 


Stretch-activated 




35 


21 


1 


2 


8601 


13 


AAQ24828 


AFP-1. Homo sapie 




36 


21 


1 


2 


8601 


13 


AAQ24 82 9 


AFP-1 (C 7508 T) . 


c 


37 


21 


1 


2 


9997 


18 


AAT97925 


Composite sequence 


c 


38 


21 


1 


2 


9997 


19 


AAV05839 


Mouse huntingtin c 




39 


20 


1 


1 


87 


21 


AAA05701 


Oligonucleotide SE 


c 


40 


20 


1 


1 


111 


14 


AAQ39031 


Oligonucleotide id 


c 


41 


20 


1 


1 


115 


14 


AAQ39033 


Oligonucleotide id 




42 


20 


1 


1 


452 


22 


AAF68087 


Human lung tumour 




43 


20 


1 


1 


1289 


22 


AAF75749 


Rice MADS box gene 


c 


44 


20 


1 


1 


1371 


18 


AAT78974 


Mouse Huntington's 




45 


20 


1 


1 


1491 


20 


AAX15147 


Nucleotide sequenc 



ALIGNMENTS 



RESULT 1 
AAF32137 

ID AAF32137 standard; cDNA; 1797 BP. 
XX 

AC AAF32137; 
XX 

DT 10-APR-2001 {first entry) 
XX 

DE M persicae ecdysone receptor EcR subunit coding sequence SEQ ID NO: 13. 
XX 

KW Insect; steroid hormone receptor; juvenile hormone receptor; fly; 

KW EcR subunit; USP subunit; insecticide; ss. 

XX 

OS Myzus persicae. 
XX 

PN WO200102436-A1 . 
XX 

PD ll-JAN-2001. 
XX 

PF 30-JUN-2000; 2000WO-AU00799 . 
XX 

PR 01-JUL-1999; 99US- 0346470 . 
XX 

PA (CSIR ) COMMONWEALTH SCI & IND RES ORG. 
XX 

PI Hill RJ, Hannan GN; 
XX 

DR WPI; 2001-080981/09. 

DR P-PSDB; AAB6 7101. 
XX 

PT New nucleic acid molecule for the regulation of gene expression in 

PT insects - 

XX 

PS Claim 23; Page 13 8-141; 172pp; English. 
XX 

CC The present invention provides the protein and coding sequences of the 

CC Bemisia tabaci (Silverleaf whitefly) , Myzus persicae (aphid) and Lucilia 

CC cuprina (sheep blowfly) steroid and juvenile hormone steroid receptor 

CC subunits EcR and USP. These can be used to screen for agents with 

CC insecticidal activity. 
XX 



SQ Sequence 1797 BP; 512 A; 444 C; 459 G; 382 T; 0 other; 



Query Match 100.0%; Score 1797; DB 22; Length 1797; 

Best Local Similarity 100.0%; Pred. No. 0; 

1797; Conservative 0; Mismatches 0; Indels 0; Gaps 

atgatggaccagaaatgtgacgtcggcggtggtggtgtcgctgctgccgccgccggtatc 60 

IIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

atgatggaccagaaatgtgacgtcggcggtggtggtgtcgctgctgccgccgccggtatc 60 
ggtggcggcggtgtcggcggcctcatgtcgtacaaccgtggccgtggcggcaccgaggtc 12 C 

iiiiiiiiiiiMiMiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiniiii 



atcatcaaaccccgtagtcctgccgtggtgcaggtggccaccggtggcagttaccacggc 180 

1 1 1 . ! . ! j I I ! I I ' I I I i : I I I : 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I I I I : ! ! I 
atcatcaaaccccgtagtcctgccgtggtgcaggtggccaccggtggcagttaccacggc 180 

ctgccggcggcctccgacgccgtcatcgtgcgcagcccgccaggcggccacttgcccggg 24 0 

I ! I : I 1 1 : I I I I : , I 1 1 . 1 1 1 I 1 1 1 1 1 1 1 . I i I : : I I I I M 

ctgccggcggcctccgacgccgtcatcgtgcgcagcccgccaggcggccacttgcccggg 24 0 
ccgcagcagcaagtgccgccgtcccgcaacggctgttccaccctgtttagcgacatcgct 3 0 0 

II I II MMMMII IMMIIIMM II i III MIMIIII III 

ccgcagcagcaagtgccgccgtcccgcaacggctgttccaccctgtttagcgacatcgct 3 00 
ggcgtcaagcgactcaggcccgacgattggttggccgtcaactcgccgcccgcctcttcg 360 

iiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiii 

ggcgtcaagcgactcaggcccgacgattggttggccgtcaactcgccgcccgcctcttcg 360 



IIIIIIMIIIIIIIIIIIIIMIIIIMMMIIIIMIIIIIIIIIIMIIMIII 



ggcggcggtggttacaacacgtctccaatgtcgaccaacagctacgacccgtacagtccg 4 80 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIII 

ggcggcggtggttacaacacgtctccaatgtcgaccaacagctacgacccgtacagtccg 4 80 
atgagtggaaaaatcgtcaaagaagagttgtctccgccaaacagcctgtcgggagtcagc 54 0 

1 1 1 . i 1 1 1 1 1 1 1 < 1 1 : i ii - 1 1 1 ' 1 1 1 .ill 

atgagtggaaaaatcgtcaaagaagagttgtctccgccaaacagcctgtcgggagtcagc 54 0 



IIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 



aacacctcggcatcgggccccgggggtggcgttggtggcaatgtgctgaacaaccgacct 66 0 

MM II IMIhllMI IMIIIIIIIIII inillMIIIIIIII III 

aacacctcggcatcgggccccgggggtggcgttggtggcaatgtgctgaacaaccgacct 660 

cccgaagagctgtgcctggtgtgtggcgaccggtcgtccggttaccattacaacgctctc 720 

IIII'IIIIIIIIn lllllll: IN illlllll MIM III, IIM'II 

cccgaagagctgtgcctggtgtgtggcgaccggtcgtccggttaccattacaacgctctc 72 0 

acatgcgaaggatgcaaggggttcttccggaggagcatcaccaagaacgccgtgtaccag 78 0 

MIMIIIM.IM MUM IMIMMMMI. MIIMMMIMIMIMI III 

acatgcgaaggatgcaaggggttcttccggaggagcatcaccaagaacgccgtgtaccag 780 

tgcaagtacggcaacaattgcgaaatcgacatgtacatgaggcggaagtgccaggagtgc 84 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIMIIIII 

tgcaagtacggcaacaattgcgaaatcgacatgtacatgaggcggaagtgccaggagtgc 84 0 

cggctgaaaaaatgcctgaccgtcggcatgaggcctgaatgtgttgtacctgaagttcaa 900 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

cggctgaaaaaatgcctgaccgtcggcatgaggcctgaatgtgttgtacctgaagttcaa 900 

tgcgcagtaaaaagaaaggagaaaaaagctcaacgagaaaaagataaaccaaattctact 960 

MM IIIIMIIIIMIII IMIIIMM. MIIIMI MIIIIMI Mill MM 

tgcgcagtaaaaagaaaggagaaaaaagctcaacgagaaaaagataaaccaaattctact 960 



Matches 


Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 



Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



acagacatttctcctgaaataataaaaatagaacctacagagatgaagattgaatgtggt 1020 

I I I I I I I I I I I I I I 'I I I 1 I I I I I I I I ; j I I I ] 1 ! I I I I I I : I I ^ > I I I I : - I I I I I I I I 

acagacatttctcctgaaataataaaaatagaacctacagagatgaagattgaatgtggt 1020 

gaaccaatgataatgggcacacctatgccgactgtaccttacgtgaaacctttgagttct 1080 

IMIIIIIIIIIIMIIII IIIIII1MIIIMIIIIIIIIIM IMIMMMM 

gaaccaatgataatgggcacacctatgccgactgtaccttacgtgaaacctttgagttct 1080 

gaacaaaaagaactgatccaccgacttgtctatttccaggatcaatatgaagctcctagt 1140 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

gaacaaaaagaactgatccaccgacttgtctatttccaggatcaatatgaagctcctagt 1140 

gaaaaagacatgaaacgtttaacaataaataatcaaaatatggatgaatatgatgaagaa 1200 

IMIMMMM MIMMMMMIMM! II MM III MUM 

gaaaaagacatgaaacgtttaacaataaataatcaaaatatggatgaatatgatgaagaa 1200 

aaacaaagtgacaccacatatcgaatcatcactgagatgacaatactcacagttcaactg 1260 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiinii 

aaacaaagtgacaccacatatcgaatcatcactgagatgacaatactcacagttcaactg 12 60 

attgttgagtttgccaaacgattaccaggtttcgataaacttgtaagagaagatcaaatc 1320 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

attgttgagtttgccaaacgattaccaggtttcgataaacttgtaagagaagatcaaatc 1320 

actttactcaaggcttgctcaagtgaagctatgatgttcagggtagcaaggaagtatgac 1380 

III III MMMMMMM MM IIIMIimillMIIMIIM II 

actttactcaaggcttgctcaagtgaagctatgatgttcagggtagcaaggaagtatgac 1380 

atcaccactgactcaatagtgtttgctaacaaccagccattttcagctgattcatataac 144 0 

iihiiii MMiiiiiiii; in i iii'Mi in mi mil: ii 

atcaccactgactcaatagtgtttgctaacaaccagccattttcagctgattcatataac 1440 

aaagctggattgggagatgccattgaaaaccaactgtcattcagtcggtttatgtacaat 1500 

MIMMMMMM Ml IMMMIMMIMMIMMIMMIMMI MM 

aaagctggattgggagatgccattgaaaaccaactgtcattcagtcggtttatgtacaat 1500 

atgaaggtggataacgcagaatatgccttattgaccgccatcgtcatattttcgagtagg 1560 

MM MIMIIMI MMIMMMMIMMMMMIIIMIMII Ml 

atgaaggtggataacgcagaatatgccttattgaccgccatcgtcatattttcgagtagg 1560 

ccaaatttactagatggttggaaagtggagaaaatccaagaaatctacctagagtcctta 1620 

II Mi MIMIMI MIIMIII MIMMM MIMMMMII; II 

ccaaatttactagatggttggaaagtggagaaaatccaagaaatctacctagagtcctta 1620 

aaagcttatgtagataatcgagaccgtgacacagcaactgtacgatatgcgcgacttctc 16 80 

MM; II MIMMIIIIMI Mil Ml Ml IMMIIIMIIM II 

aaagcttatgtagataatcgagaccgtgacacagcaactgtacgatatgcgcgacttctc 1680 

tcagtacttacagaattgcgcacattaggcaatgaaaactctgagctatgtatgacactg 1740 

MIIIIIIIIMMMIIM MM lllllll MIMI II Mill lllllll 

tcagtacttacagaattgcgcacattaggcaatgaaaactctgagctatgtatgacactg 1740 



llllllllilllllllMIIIIIIMIIIIIIIIIIIIIIIIIIIIM 



RESULT 2 
AAX90669 

ID AAX90669 standard; CDNA; 13 53 BP. 
XX 

AC AAX90669; 
XX 

DT ll-OCT-1999 (first entry) 
XX 

DE M. persicae ecdysone receptor EcR polypeptide subunit full-length cDNA. 
XX 



KW Myzus persicae EcR polypeptide subunit full-length cDNA; aphid; 

KW hemipteran insect; thermostable ecdysone receptor; reporter gene; 

KW EcR partner protein; USP polypeptide; insect steroid; juvenile hormone; 

KW juvenile hormone receptor; ecdysteroid receptor; SRE; 

KW steroid response element; beta-galactosidase gene; 

KW chloramphenicol acetyltransf erase gene; CAT gene; iridoid glycoside; 

KW insecticidally-active agent; bisacylhydrazine insecticide; ds . 

XX 

OS Myzus persicae. 
XX 

FH Key 

FT CDS 
FT 
FT 
FT 
XX 

PN WO9936520-A1 
XX 

PD 22-JUL-1999. 
XX 

PF 15-JAN-1999; 
XX 

PR 15-JAN-1998; 
XX 

PA (CSIR ) COMMONWEALTH SCI & IND RES ORG. 
XX 

PI Hannan GN, Hill RJ; 
XX 

DR WPI; 1999-444393/37. 

DR P-PSDB; AAY28606. 
XX 

PT Steroidal and juvenile hormone receptors and partner proteins, 

PT useful for identification of modulators and insect icidal compounds 

XX 

PS Claim 16; Pages 108-111; 125pp; English. 
XX 

CC The present sequence is a full-length cDNA encoding an EcR polypeptide 

CC subunit of ecdysone receptor from a hemipteran aphid, Myzus 

CC persicae. The EcR polypeptide and an EcR partner protein 

CC (USP polypeptide) subunit s form the functional heterodimeric ecdysone 

CC receptor which comprises DNA- and ligand- binding domains and 

CC is thermostable. It binds an insect steroid, juvenile hormone 

CC or an insecticidally-active agent to form a complex that modulates the 

CC expression of a gene operably under the control of a steroid response 

CC element (SRE) or a promoter comprising SRE. The polynucleotide is used 

CC to produce recombinant insect ecdysone and juvenile hormone receptors 

CC which are used to regulate expression of reporter genes like beta- 

CC galactosidase, beta-glucuronidase and chloramphenicol acetyltransf erase 

CC (CAT) genes. The receptors are also useful for screening and identifying 

CC insecticidally-active agents e.g. bisacylhydrazine insecticide, iridoid 

CC glycoside and other non-steroidal modulators of the ecdysteroid and 

CC juvenile hormone receptors. 

XX 

SQ Sequence 1353 BP; 448 A; 289 C; 310 G; 306 T; 0 other; 



Location/Qualifiers 
1. .1353 
/*tag= a 

/product= "EcR polypeptide subunit of M. persicae 
ecdysone receptor" 



99WO-AU00033 . 
98AU-0001356. 



Query Match 75.1%; Score 1350; DB 20; Length 1353; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1350; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 44 8 atgtcgaccaacagctacgacccgtacagtccgatgagtggaaaaatcgtcaaagaagag 507 

I 1 1 1 I I 1 1 ! 1 1 I : I 1 1 I ! I 1 1 I 1 1 1 : i 1 1 I i 1 1 1 I 1 1 I . I I I I [ I ill. 

Db 1 atgtcgaccaacagctacgacccgtacagtccgatgagtggaaaaatcgtcaaagaagag 60 

Qy 508 ttgtctccgccaaacagcctgtcgggagtcagcagccattcggatgggttgaagaagaag 567 

1 1 : - ' 1 1 1 1 : f 1 1 ! 1 1 1 1 : 1 1 1 1 : II II I ! 1 1 1 1 . 1 1 1 1 1 ! i 1 1 

Db 61 ttgtctccgccaaacagcctgtcgggagtcagcagccattcggatgggttgaagaagaag 12 0 
Qy 56 8 aaactcaaccacacgccctcgaccggtgtcgtcaacacctcggcatcgggccccgggggt 627 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIII 

Db 121 aaactcaaccacacgccctcgaccggtgtcgtcaacacctcggcatcgggccccgggggt 180 



Qy 


628 


Db 


181 


Qy 


688 


Db 


241 


Qy 


748 


Db 


301 


Qy 


808 


Db 


361 


Qy 


868 


Db 


421 


Qy 


928 


Db 


481 


Qy 


988 


Db 


541 


Qy 


1048 


Db 


601 


Qy 


1108 


Db 


661 


Qy 


1168 


Db 


721 


Qy 


1228 


Db 


781 


Qy 


1288 


Db 


841 


Qy 


1348 


Db 


901 


Qy 


1408 


Db 


961 


Qy 


1468 


Db 


1021 


Qy 


1528 


Db 


1081 


Qy 


1588 


Db 


1141 


Qy 


1648 



ggcgttggtggcaatgtgctgaacaaccgacctcccgaagagctgtgcctggtgtgtggc 6 87 

III'MI I IN III 1 1 li: I III II KMI Mill MINIMI II II 1 1 1 II II 

ggcgttggtggcaatgtgctgaacaaccgacctcccgaagagctgtgcctggtgtgtggc 240 
gaccggtcgtccggttaccattacaacgctctcacatgcgaaggatgcaaggggttcttc 747 

IIMM II Ml 1 1 Ml Mil MIM MM MINIMI Ml I II II II 

gaccggtcgtccggttaccattacaacgctctcacatgcgaaggatgcaaggggttcttc 300 
cggaggagcatcaccaagaacgccgtgtaccagtgcaagtacggcaacaattgcgaaatc 807 

I : I M I M M 1 1. M 1 1 1 1 1 1 1 I M M I M M 1 : 1 1 ' I M 1 1 1 M M M 1 M : : I 

cggaggagcatcaccaagaacgccgtgtaccagtgcaagtacggcaacaattgcgaaatc 360 
gacatgtacatgaggcggaagtgccaggagtgccggctgaaaaaatgcctgaccgtcggc 867 

1 1 1 1 NININ I II Ml N III II II II Nl II MINN Mill II N IN II II 

gacatgtacatgaggcggaagtgccaggagtgccggctgaaaaaatgcctgaccgtcggc 420 
atgaggcctgaatgtgttgtacctgaagttcaatgcgcagtaaaaagaaaggagaaaaaa 927 

Nil MINI MINIMI IM INN IIIINIIIIIIMI MM! 

atgaggcctgaatgtgttgtacctgaagttcaatgcgcagtaaaaagaaaggagaaaaaa 4 80 
gctcaacgagaaaaagataaaccaaattctactacagacatttctcctgaaataataaaa 987 

MINI IMI MINI III MM II INIIMII I NUN 

gctcaacgagaaaaagataaaccaaattctactacagacatttctcctgaaataataaaa 540 
atagaacctacagagatgaagattgaatgtggtgaaccaatgataatgggcacacctatg 104 7 

MM! Mill I; . M 

atagaacctacagagatgaagattgaatgtggtgaaccaatgataatgggcacacctatg 600 
ccgactgtaccttacgtgaaacctttgagttctgaacaaaaagaactgatccaccgactt 1107 

i f 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 j 1 1 

ccgactgtaccttacgtgaaacctttgagttctgaacaaaaagaactgatccaccgactt 660 
gtctatttccaggatcaatatgaagctcctagtgaaaaagacatgaaacgtttaacaata 116 7 

1 1 1 1 1 1 1 I j 1 1 : 1 1 : ! I ^ 1 1 1 1 i 1 1 1 1 1 1 1 . 1 1 1 1 1 ; , 1 1 j 1 1 1 1 1 1 ] 

gtctatttccaggatcaatatgaagctcctagtgaaaaagacatgaaacgtttaacaata 720 
aataatcaaaatatggatgaatatgatgaagaaaaacaaagtgacaccacatatcgaatc 1227 

IMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIMI 

aataatcaaaatatggatgaatatgatgaagaaaaacaaagtgacaccacatatcgaatc 780 
atcactgagatgacaatactcacagttcaactgattgttgagtttgccaaacgattacca 12 87 

1 1 1 1 1 1 1 ; 1 1 1 1 ; 1 1 M : 1 1 1 i 1 1 1 1 1 ! I , I i 1 1 1 . II M : M 1 1 M I 1 1 1 1 

atcactgagatgacaatactcacagttcaactgattgttgagtttgccaaacgattacca 84 0 
ggtttcgataaacttgtaagagaagatcaaatcactttactcaaggcttgctcaagtgaa 1347 

NIIIIMIMIII Ml IIINIIMMN MNIMIIINN MINI 

ggtttcgataaacttgtaagagaagatcaaatcactttactcaaggcttgctcaagtgaa 900 
gctatgatgttcagggtagcaaggaagtatgacatcaccactgactcaatagtgtttgct 14 07 

:' 1 1 1 M M M M ' MM: Mill II I M I M 1 1 M : 1 1 1 ! 1 1 1 

gctatgatgttcagggtagcaaggaagtatgacatcaccactgactcaatagtgtttgct 960 
aacaaccagccattttcagctgattcatataacaaagctggattgggagatgccattgaa 1467 

t i 1 1 1 1 i 1 1 1 ! I I 1 1 1 1 1 1 1 1 1 1 i 1 1 ' i I 1 1 1 . 1 1 ! 1 1 1 1 

aacaaccagccattttcagctgattcatataacaaagctggattgggagatgccattgaa 1020 
aaccaactgtcattcagtcggtttatgtacaatatgaaggtggataacgcagaatatgcc 1527 

NIMIIIMI IIIIINIIINIINIIMI IIIIIIIIIIIIIMMI INN 

aaccaactgtcattcagtcggtttatgtacaatatgaaggtggataacgcagaatatgcc 1080 
ttattgaccgccatcgtcatattttcgagtaggccaaatttactagatggttggaaagtg 1587 

NIMiMINII lllllllil!IIIINIIIII Ml IMIIINII IN 

ttattgaccgccatcgtcatattttcgagtaggccaaatttactagatggttggaaagtg 114 0 
gagaaaatccaagaaatctacctagagtccttaaaagcttatgtagataatcgagaccgt 164 7 

MINMIIMMMMIIIMIINI NINII MMNNIMM 

gagaaaatccaagaaatctacctagagtccttaaaagcttatgtagataatcgagaccgt 1200 
gacacagcaactgtacgatatgcgcgacttctctcagtacttacagaattgcgcacatta 1707 

MINI MI IIMINMMIIIIIIMIM Ml IIMIIMNIM 



Db 1201 gacacagcaactgtacgatatgcgcgacttctctcagtacttacagaattgcgcacatta 1260 



Qy 


1708 


Db 


1261 


Qy 


1768 


Db 


1321 



ggcaatgaaaactctgagctatgtatgacactgaaactgaaaaacagagtagtaccccca 1767 

II MIIIIIMIII lllllllllllllll MINIM IIIMIII III 

ggcaatgaaaactctgagctatgtatgacactgaaactgaaaaacagagtagtaccccca 1320 
ttcttggccgaaatatgggatgtcatgcca 1797 

miMIIIIIIIIMIMMIIIIIIIII 

ttcttggccgaaatatgggatgtcatgcca 1350 



SUMMARIES 

% 

Result Query- 
No. Score Match Length DB ID Description 





1 


34 


1 


9 


343 


11 


BG262692 


BG262692 


WHE094 0_B 


c 


2 


31 


1 


7 


501 


10 


AV431489 


AV431489 


AV431489 


c 


3 


29 


1 


6 


342 


11 


BG263081 


BG263081 


WHE0946_H 


c 


4 


29 


1 


6 


498 


11 


BG262509 


BG262509 


WHE0936_G 




5 


29 


1 


6 


531 


10 


AU066588 


AU066588 


AU066588 




6 


29 


1 


6 


682 


13 


CNS01UUW 


AL168305 


Tetraodon 




7 


29 


1 


6 


865 


13 


CNS04VAZ 


AL308852 


Tetraodon 




8 


27 


1 


5 


449 


13 


L4393X 


AL354210 


Leishmani 


c 


9 


26 


1 


4 


203 


13 


AZ605406 


AZ605406 


1M0426A15 




10 


26 


1 


4 


527 


10 


AV6284 73 


AV628473 


AV62 84 73 


c 


11 


26 


1 


4 


828 


10 


AA986599 


AA986599 


uc81h02 .y 




12 


25 


1 


4 


263 


10 


AA409255 


AA409255 


EST01178 




13 


25 


1 


4 


362 


10 


AI597416 


AI597416 


vd86a08 .y 




14 


25 


1 


4 


364 


10 


AI020332 


AI020332 


ub27b01.r 




15 


25 


1 


4 


468 


13 


AQ848107 


AQ848107 


LMAJFV1_1 




16 


25 


1 


4 


494 


11 


BF450595 


BF450595 


uz68a05 .y 


c 


17 


25 


1 


4 


495 


10 


AW356009 


AW356009 


707018D07 


c 


18 


25 


1 


4 


550 


10 


AW927646 


AW927646 


945012H02 


c 


19 


25 


1 


4 


562 


10 


BE129837 


BE129837 


945030H09 




20 


25 


1 


4 


610 


10 


AW927887 


AW927887 


945010E03 


c 


21 


25 


1 


4 


616 


10 


AW129803 


AW129803 


707004E01 




22 


25 


1 


4 


637 


10 


AW289014 


AW289014 


707004E01 




23 


25 


1 


4 


655 


10 


BE310176 


BE310176 


601092815 




24 


25 


1 


4 


658 


10 


AV399810 


AV399810 


AV399810 




25 


25 


1 


4 


659 


11 


BF124507 


BF124507 


601761747 




26 


25 


1 


4 


696 


11 


BF784168 


BF784168 


602108030 




27 


25 


1 


4 


722 


10 


AI326449 


AI326449 


mm72a03 .x 




28 


25 


1 


4 


789 


11 


BI227188 


BI227188 


602949305 




29 


25 


1 


4 


792 


11 


BF166495 


BF166495 


601774990 




30 


25 


1 


4 


864 


10 


AV39954 3 


AV399543 


AV399543 




31 


25 


1 


4 


969 


11 


BF784421 


BF784421 


602110804 


c 


32 


25 


1 


4 


1191 


10 


BE300471 


BE300471 


600944055 




33 


24 


1 


3 


240 


10 


AV196856 


AV196856 


AV196856 


c 


34 


24 


1 


3 


289 


10 


AW028652 


AW028652 


wv33fl2 .x 




35 


24 


1 


3 


330 


10 


BE643572 


BE643572 


945041F03 




36 


24 


1 


3 


350 


13 


AQ903988 


AQ903988 


GSSTC0582 




37 


24 


1 


3 


360 


10 


BE050736 


BE050736 


za68hl2 .b 




38 


24 


1 


3 


360 


11 


C65115 


C65115 C65115 Yu j i 




39 


24 


1 


3 


375 


10 


AU164151 


AU164151 


AU164151 




40 


24 


1 


3 


378 


11 


BG943346 


BG943346 


ax36e09 .x 




41 


24 


1 


3 


403 


11 


BE994256 


BE994256 


UI-M-CGOp 




42 


24 


1 


3 


432 


11 


BF099352 


BF099352 


601751738 




43 


24 


1 


3 


488 


10 


BE051832 


BE051832 


za88f 12 .g 




44 


24 


1 


3 


505 


10 


BE393286 


BE393286 


601308304 




45 


24 


1 


3 


537 


10 


AV432941 


AV432941 


AV432941 



SEQ ID NO: 9 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


507.4 


28 


2 


2401 


3 


AF049136 


AF049136 Locusta m 


2 


460 .4 


25 


6 


2469 


3 


TMECDYREC 


Y11533 T.molitor m 


3 


382.6 


21 


3 


4562 


3 


AF034086 


AF034086 Celuca pu 


4 


341.2 


19 


0 


2355 


3 


AF325360 


AF325360 Calliphor 


5 


338.6 


18 


8 


3336 


3 


LCU75355 


U75355 Lucilia cup 


6 


315.4 


17 


6 


4130 


3 


AF210733 


AF210733 Aedes alb 


7 


312.2 


17 


4 


3875 


3 


AF020187 


AF02 018 7 Amblyomma 


8 


310.6 


17 


3 


1812 


3 


AF020186 


AF020186 Amblyomma 


9 


310.6 


17 


3 


2166 


3 


AF020188 


AF02 0188 Amblyomma 


10 


307.4 


17 


1 


4158 


3 


AAU02 021 


U02 021 Aedes aegyp 


11 


307.2 


17 


1 


2840 


3 


MSU19812 


U19812 Manduca sex 


12 


300.6 


16 


7 


3217 


3 


CCAJ4341 


AJ224341 Ceratitis 


13 


298 .2 


16 


6 


3962 


3 


CFU29531 


U2 9531 Choristoneu 


14 


298 .2 


16 


6 


6852 


3 


AF092030 


AF092030 Choriston 


15 


295 


16 


4 


5534 


3 


DROECR 


M74078 Drosophila 


16 


273.2 


15 


2 


1934 


6 


A59201 


A59201 Sequence 2 


17 


273.2 


15 


2 


2464 


6 


A59202 


A59202 Sequence 3 


18 


273.2 


15 


2 


2745 


3 


HVECR 


Y09009 H.virescens 


19 


273.2 


15 


2 


2745 


6 


A59203 


A59203 Sequence 4 


20 


272.6 


15 


2 


8363 


12 


AF098284 


AF098284 Cloning v 


21 


272.6 


15 


2 


11001 


12 


AF264696 


AF2 64 6 96 Cloning v 


22 


271.6 


15 


1 


2349 


3 


S60739 


S60739 cEcRH=ecdys 


23 


257.8 


14 


3 


2711 


3 


BMOECDYREC 


L35266 Bombyx mori 


24 


254 


14 


1 


2720 


3 


BMOBMECRB1 


D43943 Silkworm mR 


25 


212 .6 


11 


8 


1017 


3 


PCO251809 


AJ251809 Precis co 


26 


202.4 


11 


3 


1038 


3 


BAN251810 


AJ251810 Bicyclus 


27 


201 


11 


2 


1656 


6 


AR095098 


AR095098 Sequence 


28 


188 


10 


5 


948 


6 


A59205 


A59205 Sequence 6 


29 


171.4 


9 


5 


8743 


2 


AC019981 


AC019981 Drosophil 


30 


171.4 


9 


5 


135182 


3 


AC007121 


AC007121 Drosophil 


31 


171.4 


9 


5 


163990 


3 


AC009255 


AC009255 Drosophil 


32 


171.4 


9 


5 


180699 


3 


AC008339 


AC008339 Drosophil 


33 


171.4 


9 


5 


312373 


3 


AE003784 


AE003784 Drosophil 


34 


153.6 


8 


5 


2070 


10 


RNU18374 


U18374 Rattus norv 


35 


150.4 


8 


4 


1798 


10 


MMU09416 


U09416 Mus musculu 


36 


137. 8 


7 


7 


1456 


10 


MMU09418 


U09418 Mus musculu 


37 


137.8 


7 


7 


1555 


10 


MMU09417 


U09417 Mus musculu 


38 


136.2 


7 


6 


189 


3 


UPU31817 


U31817 Uca pugilat 


39 


129 . 8 


7 


2 


621 


3 


AF121910 


AF121910 Bradysia 


40 


128 .8 


7 


2 


2218 


9 


HSU68233 


U6 8233 Human fame 


41 


128.8 


7 


2 


2233 


6 


192592 


192592 Sequence 1 


42 


121 


6 


7 


546 


3 


AF239825 


AF23 982 5 Sarcophag 


43 


120.6 


6 


7 


183 


6 


E32705 


E32705 Novel nucle 


44 


115.4 


6 


4 


1431 


9 


AF384555 


AF384555 Homo sapi 


45 


110.4 


6 


1 


1524 


9 


BC008819 


BC0 08819 Homo sapi 






a. 








SUMMARIES 




3Ult 




-o 

Query- 










No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1797 


100 


0 


1797 


22 


AAF32137 


M persicae ecdyson 


2 


1350 


75 


1 


1353 


20 


AAX90669 


M. persicae ecdyso 


3 


424 .8 


23 


6 


585 


20 


AAX90668 


M. persicae ecdyso 


4 


424.8 


23 


6 


585 


22 


AAF32134 


M persicae ecdyson 


5 


333.8 


18 


6 


2812 


19 


AAV6 03 02 


Lucilia cuprina ec 


6 


326.6 


18 


2 


2273 


20 


AAX90666 


cDNA encoding EcR 


7 


326.6 


18 


2 


2273 


22 


AAF32130 


L cuprina ecdysone 


8 


300 .2 


16 


7 


2126 


21 


AAA10312 


European corn bore 


9 


295 


16 


4 


2241 


18 


AAV06244 


Modified ecdysone 


10 


295 


16 


4 


2970 


14 


AAQ37556 


DHR23alpha cDNA cl 


11 


295 


16 


4 


5534 


12 


AAQ13574 


EcR gene . Drosoph 


12 


293.4 


16 


3 


2241 


21 


AAZ51469 


Vector pVgRXR enco 


13 


293 .4 


16 


3 


2295 


21 


AAZ51470 


Vector pVgRXR-5A/5 


14 


293 .4 


16 


3 


2301 


21 


AAZ51471 


Control vector pVg 


15 


290.8 


16 


2 


3126 


18 


AAV06245 


Modified ecdysone 


16 


290.2 


16 


1 


2241 


18 


AAV06243 


Modified ecdysone 


17 


273.2 


15 


2 


1934 


18 


AAT45793 


Heliothis ecdysone 


18 


273.2 


15 


2 


2464 


18 


AAT45794 


Heliothis ecdysone 



c 



c 



c 



c 



c 



c 



19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



150.4 
137.8 
128.8 
123.2 
123 .2 
123.2 
123.2 
123 .2 
123 .2 
121.4 
121.4 
121.4 
121.4 
121 .4 
121.4 
120.6 
115 .4 
110 .4 



273.2 
257. 8 
257.8 
257.8 



201 
201 
201 
201 
188 



15.2 
14 .3 
14.3 
14.3 
11.2 
11.2 
11.2 
11.2 
10.5 
8.4 
7.7 
7.2 
6.9 
6.9 
6.9 
6.9 
6.9 
6.9 
6.8 
6.8 
6.8 
6.8 
6.8 
6.8 
6.7 
6.4 
6.1 



2745 
1977 
2711 
9072 
1656 
1656 
1656 
1656 
948 
1787 
1677 
2233 
936 
936 
936 
936 
936 
938 
936 
936 
936 
936 
936 
938 
183 
2220 
1528 



18 
20 
20 
20 
19 
19 
21 
21 
18 
17 
17 
19 
22 
22 
22 
22 
22 
22 
22 
22 
22 
22 
22 
22 
20 
22 
21 



AAT4 5795 
AAX07557 
AAX07553 
AAX07558 
AAT86658 
AAV02768 
AAZ57062 
AAZ92154 
AAT45796 
AAT31928 
AAT31932 
AAV10120 
AAF58252 
AAF58254 
AAF58257 
AAF58259 
AAF58262 
AAF58255 
AAF58252 
AAF58254 
AAF58257 
AAF58259 
AAF58262 
AAF58255 
AAX59971 
AAD12786 
AAZ35043 



Heliothis ecdysone 
Synthetic modified 
Bombyx mori nuclea 
Synthetic modified 
Ecdysone receptor 
Drosophila ecdyson 
Drosophila ecdyson 
Ecdysone receptor 
Spodoptera ecdyson 
Retinoid X recepto 
Retinoid X recepto 
Human retinoid rec 
Oligonucleotide Dl 
Oligonucleotide Dl 
Oligonucleotide Dl 
Oligonucleotide D2 
Oligonucleotide D2 
Oligonucleotide Dl 
Oligonucleotide Dl 
Oligonucleotide Dl 
Oligonucleotide Dl 
Oligonucleotide D2 
Oligonucleotide D2 
Oligonucleotide Dl 
Swellfish AN023 pr 



Human nuclear horm 
Human nuclear orph 



ALIGNMENTS 



RESULT 1 
AAF32137 

ID AAF32137 standard; cDNA; 1797 BP. 
XX 

AC AAF3213 7; 
XX 

DT 10-APR-2001 (first entry) 
XX 

DE M persicae ecdysone receptor EcR subunit coding sequence SEQ ID NO: 13. 
XX 

KW Insect; steroid hormone receptor; juvenile hormone receptor; fly; 

KW EcR subunit; USP subunit; insecticide; ss. 

XX 

OS Myzus persicae. 
XX 

PN WO200102436-A1. 
XX 

PD ll-JAN-2001. 
XX 

PF 30-JUN-2000; 2000WO-AU00799 . 
XX 

PR 01-JUL-1999; 99US-0346470 . 
XX 

PA (CSIR ) COMMONWEALTH SCI & IND RES ORG. 
XX 

PI Hill RJ, Hannan GN; 
XX 

DR WPI; 2001-080981/09. 
DR P-PSDB; AAB67101 . 
XX 

PT New nucleic acid molecule for the regulation of gene expression in 

PT insects - 

XX 

PS Claim 23; Page 13 8-141; 172pp; English. 
XX 

CC The present invention provides the protein and coding sequences of the 
CC Bemisia tabaci {Silverleaf whitef ly) , Myzus persicae (aphid) and Lucilia 
CC cuprina (sheep blowfly) steroid and juvenile hormone steroid receptor 
CC subunits EcR and USP. These can be used to screen for agents with 



CC insecticidal activity. 
XX 

SQ Sequence 1797 BP; 512 A; 444 C; 459 G; 382 T; 0 other; 



Query Match 100.0%; Score 1797; DB 22; Length 1797; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1797; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 1 atgatggaccagaaatgtgacgtcggcggtggtggtgtcgctgctgccgccgccggtatc 6 0 

IIIIIIIIMIII IIIMilllllMI IIIMIirill! IMIIIIMIII 

Db 1 atgatggaccagaaatgtgacgtcggcggtggtggtgtcgctgctgccgccgccggtatc 60 

Qy 61 ggtggcggcggtgtcggcggcctcatgtcgtacaaccgtggccgtggcggcaccgaggtc 120 

Ml : 'MM MM 1 : 1 II 1 1 1! I. Jill MIL ,1 II, II I II III I : 'II 1 1 1 II I 

Db 61 ggtggcggcggtgtcggcggcctcatgtcgtacaaccgtggccgtggcggcaccgaggtc 120 

Qy 121 atcatcaaaccccgtagtcctgccgtggtgcaggtggccaccggtggcagttaccacggc 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I J 1 1 

Db 121 atcatcaaaccccgtagtcctgccgtggtgcaggtggccaccggtggcagttaccacggc 180 
Qy 181 ctgccggcggcctccgacgccgtcatcgtgcgcagcccgccaggcggccacttgcccggg 24 0 

! 1 1 1 1 1 1 II I E 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II M 1 1 1 1 1 M 1 1 M M 1 1 1 1 J 1 1 1 i 1 1 1 1 M 

Db 181 ctgccggcggcctccgacgccgtcatcgtgcgcagcccgccaggcggccacttgcccggg 24 0 
Qy 241 ccgcagcagcaagtgccgccgtcccgcaacggctgttccaccctgtttagcgacatcgct 300 

Ml IIIIIIIMMIMIMI llllllll III . IMIIIMIIIII III 

Db 241 ccgcagcagcaagtgccgccgtcccgcaacggctgttccaccctgtttagcgacatcgct 300 
Qy 301 ggcgtcaagcgactcaggcccgacgattggttggccgtcaactcgccgcccgcctcttcg 360 

MIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIMIII 

Db 3 01 ggcgtcaagcgactcaggcccgacgattggttggccgtcaactcgccgcccgcctcttcg 360 
Qy 3 61 cccggcacgtcgcacatatcctacacagtcatatcgaacggcggcggcggtggcggcggt 420 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIII 

Db 361 cccggcacgtcgcacatatcctacacagtcatatcgaacggcggcggcggtggcggcggt 42 0 
Qy 421 ggcggcggtggttacaacacgtctccaatgtcgaccaacagctacgacccgtacagtccg 4 80 

1 1 1 1 1 . 1 1 j iiiiiiii 1 1 1 1 1 j i . : iii.ii 1 1 1 1 : 

Db 421 ggcggcggtggttacaacacgtctccaatgtcgaccaacagctacgacccgtacagtccg 480 

Qy 4 81 atgagtggaaaaatcgtcaaagaagagttgtctccgccaaacagcctgtcgggagtcagc 54 0 

I 'II 1 1 1 : I I I I 

Db 4 81 atgagtggaaaaatcgtcaaagaagagttgtctccgccaaacagcctgtcgggagtcagc 54 0 

Qy 541 agccattcggatgggttgaagaagaagaaactcaaccacacgccctcgaccggtgtcgtc 600 

IMIIIIIIMIIIIMIIIIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIIIIIIII 

Db 541 agccattcggatgggttgaagaagaagaaactcaaccacacgccctcgaccggtgtcgtc 600 
Qy 6 01 aacacctcggcatcgggccccgggggtggcgttggtggcaatgtgctgaacaaccgacct 660 

IIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIII 

Db 601 aacacctcggcatcgggccccgggggtggcgttggtggcaatgtgctgaacaaccgacct 660 
Qy 661 cccgaagagctgtgcctggtgtgtggcgaccggtcgtccggttaccattacaacgctctc 720 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMII 

Db 661 cccgaagagctgtgcctggtgtgtggcgaccggtcgtccggttaccattacaacgctctc 72 0 
Qy 721 acatgcgaaggatgcaaggggttcttccggaggagcatcaccaagaacgccgtgtaccag 780 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIII 

Db 721 acatgcgaaggatgcaaggggttcttccggaggagcatcaccaagaacgccgtgtaccag 780 
Qy 781 tgcaagtacggcaacaattgcgaaatcgacatgtacatgaggcggaagtgccaggagtgc 84 0 

1 1 1 ! 1 1 1 1 1 _ i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! : 1 1 1 1 ill 1 1 1 1 1 1 1 1 1 1 J 1 1 

Db 781 tgcaagtacggcaacaattgcgaaatcgacatgtacatgaggcggaagtgccaggagtgc 84 0 
Qy 841 cggctgaaaaaatgcctgaccgtcggcatgaggcctgaatgtgttgtacctgaagttcaa 900 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIII 

Db 841 cggctgaaaaaatgcctgaccgtcggcatgaggcctgaatgtgttgtacctgaagttcaa 900 
Qy 901 tgcgcagtaaaaagaaaggagaaaaaagctcaacgagaaaaagataaaccaaattctact 960 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



901 tgcgcagtaaaaagaaaggagaaaaaagctcaacgagaaaaagataaaccaaattctact 960 
acagacatttctcctgaaataataaaaatagaacctacagagatgaagattgaatgtggt 1020 

i 1 1 i I I i I i 1 1 1 1 1 1 1 ! 1 1 1 1 ! 1 1 : 1 1 j 1 1 i 1 1 i : 1 f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ' 1 1 E E I 

acagacatttctcctgaaataataaaaatagaacctacagagatgaagattgaatgtggt 102 0 
gaaccaatgataatgggcacacctatgccgactgtaccttacgtgaaacctttgagttct 1080 

i J 1 4 ' i 1 1 j i I J I 1 1 i I 1 1 1 1 1 1 1 1 : i ! 1 1 1 1 1 j . 1 1 1 1 1 1 1 F 1 1 1 i i i . 

gaaccaatgataatgggcacacctatgccgactgtaccttacgtgaaacctttgagttct 1080 
gaacaaaaagaactgatccaccgacttgtctatttccaggatcaatatgaagctcctagt 114 0 

MM II! MIMIMMMMMIII IMMMIMMIM MMMMMIM 

gaacaaaaagaactgatccaccgacttgtctatttccaggatcaatatgaagctcctagt 1140 
gaaaaagacatgaaacgtttaacaataaataatcaaaatatggatgaatatgatgaagaa 1200 

IMMIIIM 1 1 INMM II I Ml III II 1 1 MIIIM II Mill I II II Ml I II 

gaaaaagacatgaaacgtttaacaataaataatcaaaatatggatgaatatgatgaagaa 1200 
aaacaaagtgacaccacatatcgaatcatcactgagatgacaatactcacagttcaactg 1260 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIimillllMIIIIIIMIMIII 

aaacaaagtgacaccacatatcgaatcatcactgagatgacaatactcacagttcaactg 1260 
attgttgagtttgccaaacgattaccaggtttcgataaacttgtaagagaagatcaaatc 1320 

IMMIIII MM MM MMMMMMMMIIMIMN IMIMI MM 

attgttgagtttgccaaacgattaccaggtttcgataaacttgtaagagaagatcaaatc 1320 
actttactcaaggcttgctcaagtgaagctatgatgttcagggtagcaaggaagtatgac 1380 

MM III MM IMMMIIMMI MIIMM III MM > MM II 

actttactcaaggcttgctcaagtgaagctatgatgttcagggtagcaaggaagtatgac 13 80 
atcaccactgactcaatagtgtttgctaacaaccagccattttcagctgattcatataac 144 0 

IMMIIIII IMMIII IIMMIIIIIIIMIIIIIIIIl IIIMM MM 

atcaccactgactcaatagtgtttgctaacaaccagccattttcagctgattcatataac 144 0 
aaagctggattgggagatgccattgaaaaccaactgtcattcagtcggtttatgtacaat 1500 

IMMIII Ml MM MIMIIII; IIIIIMM IIMIIM: Ml II 

aaagctggattgggagatgccattgaaaaccaactgtcattcagtcggtttatgtacaat 1500 
atgaaggtggataacgcagaatatgccttattgaccgccatcgtcatattttcgagtagg 1560 

1 1 1 1 II 1 1 1 II 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 II 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 1 1 

atgaaggtggataacgcagaatatgccttattgaccgccatcgtcatattttcgagtagg 1560 
ccaaatttactagatggttggaaagtggagaaaatccaagaaatctacctagagtcctta 1620 

i i ii ill- mi. 

ccaaatttactagatggttggaaagtggagaaaatccaagaaatctacctagagtcctta 1620 
aaagcttatgtagataatcgagaccgtgacacagcaactgtacgatatgcgcgacttctc 1680 

IM III MM MM MIMMMMMIMMIM MIMMMMMIMI 

aaagcttatgtagataatcgagaccgtgacacagcaactgtacgatatgcgcgacttctc 1680 
tcagtacttacagaattgcgcacattaggcaatgaaaactctgagctatgtatgacactg 174 0 

IM: IMMIIII IMIIIIMII ! Ill II MM III llll I 

tcagtacttacagaattgcgcacattaggcaatgaaaactctgagctatgtatgacactg 174 0 
aaactgaaaaacagagtagtacccccattcttggccgaaatatgggatgtcatgcca 1797 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

aaactgaaaaacagagtagtacccccattcttggccgaaatatgggatgtcatgcca 1797 



SUMMARIES 

% 

Result Query- 
No. Score Match Length DB ID Description 



1 


171.4 


9 


5 


501 


10 


AI514995 


AI514995 


LD46649.5 


2 


171.4 


9 


5 


595 


10 


AI544194 


AI544194 


LD4 764 7 . 5 


3 


150 


8 


3 


917 


10 


AI987611 


AI987611 


ul86d09.y 


4 


144 . 6 


8 


0 


523 


10 


AI099367 


AI099367 


ue30bl2 .x 


5 


140 


7 


8 


851 


13 


CNS011ME 


AL100448 


Drosophil 


6 


137.8 


7 


7 


1728 


12 


AK002513 


AK002513 


Mus muscu 



7 


135 


.8 


7 


.6 


672 


10 


AI117017 


AI117017 


ue30bl2 .y 


8 


131 


.6 


7 


.3 


975 


11 


BF787802 


BF787802 


602113568 


9 


131 


7 


.3 


673 


11 


BF384905 


BF384905 


602045211 


10 


130 


7 


.2 


868 


11 


BI330645 


BI330645 


602981090 


11 


123 


.4 


6 


.9 


574 


10 


AI258616 


AI258616 


LP01848.5 


12 


118 


.8 


6 


.6 


652 


11 


BG710420 


BG710420 


pglln.pkO 


13 


115 


.4 


6 


.4 


756 


11 


BG171220 


BG171220 


602321423 


14 


112 


6 


.2 


346 


10 


AW483842 


AW483842 


56794 MAR 


C 15 


110 


.4 


6 


.1 


578 


10 


AI831552 


AI831552 


wj09dl0 .x 


16 


110 


.4 


6 


.1 


891 


10 


AL558946 


AL558946 


AL558946 


17 


110 


.4 


6 


.1 


1024 


10 


BE542999 


BE542999 


601068920 


18 


108 


.8 


6 


.1 


612 


11 


BF385640 


BF385640 


602045890 


19 


108 


.8 


6 


1 


674 


11 


BF232668 


BF232668 


602023254 


20 


108 


.8 


6 


1 


958 


11 


BF235121 


BF235121 


602027886 


21 


108 


.6 


6 


0 


640 


11 


BG423659 


BG423659 


602449571 


22 


107 


.8 


6 


0 


470 


10 


AA538642 


AA538642 


LD18219 . 5 


23 


107 


.2 


6 


0 


611 


11 


BG710718 


BG710718 


pglln.pkO 


24 


105 


.6 


5 


9 


358 


10 


AW431214 


AW431214 


71543 MAR 


25 


105 


.6 


5 


9 


563 


10 


BE233303 


BE233303 


139430 MA 


26 


104 


4 


5 


8 


277 


10 


BE755918 


BE755918 


209807 MA 


27 


103 


6 


5 


8 


820 


10 


AL524426 


AL524426 


AL524426 


28 


103 


2 


5 


7 


878 


10 


AL555760 


AL555760 


AL555760 


29 


103 


2 


5 


7 


895 


10 


AL531027 


AL531027 


AL531027 


30 


101 


2 


5 


6 


672 


13 


AZ897361 


A2897361 


RPCI-24-2 


31 


100 


4 


5 


6 


818 


13 


CNSO2CM0 


AL191313 


Tetraodon 


32 


99 


8 


5 


6 


1036 


13 


CNS03YPQ 


AL266615 


Tetraodon 


33 


99 


6 


5 


5 


744 


11 


BG756857 


BG756857 


602710341 


34 


99 


6 


5 


5 


870 


10 


AL516240 


AL516240 


AL516240 


35 


99 


6 


5 


5 


902 


10 


AL521072 


AL521072 


AL521072 


36 


98 


4 


5 


5 


861 


11 


BI261066 


BI261066 


602972272 


37 


98 


2 


5 


5 


965 


11 


BE902332 


BE902332 


601675628 


38 


98 


5 


5 


718 


11 


BF569478 


BF569478 


602185964 


39 


97 


6 


5 


4 


717 


10 


AU135617 


AU135617 


AU135617 


40 


97 


4 


5. 


4 


554 


11 


BI149124 


BI149124 


602913931 


41 


96 


5. 


3 


765 


11 


BI146133 


BI146133 


602910928 


42 


96 


5. 


3 


962 


13 


CNS02JSQ 


AL200627 


Tetraodon 


43 


95. 


4 


5. 


3 


841 


10 


AL559175 


' AL559175 


AL559175 


44 


94 . 


8 


5. 


3 


643 


10 


AV655930 


AV655930 


AV655930 


c 45 


94 . 


6 


5. 


3 


832 


13 


CNS013QT 


AL103199 


Drosophil 



Sequence Comparison A 

RESULT 2 
002035 

ID 002 035 PRELIMINARY; PRT; 4 91 AA. 

AC 002035; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2001 (TrEMBLrel . 17, Last annotation update) 

DE ECDYSONE RECEPTOR. 

GN ECR . 

OS Tenebrio molitor (Yellow mealworm) . 

OC Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 

OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; 

OC Cucuj if ormia; Tenebrionidae ; Tenebrio. 

OX NCBI_TaxID=7067; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98000286; PubMed=93422 39 ; 

RA Mouillet J.F., Delbecque J. P., Quennedey B., Delachambre J.; 

RT "Cloning of two putative ecdysteroid receptor isoforms from Tenebrio 

RT molitor and their developmental expression in the epidermis during 

RT metamorphosis . " ; 

RL Eur. J. Biochem. 248:856-863(1997). 

CC -!- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

CC SIMILARITY: TO C4-TYPE STEROID RECEPTOR ZINC FINGER FAMILY. 

DR EMBL; Y11533; CAA72296.1; -. 

DR HSSP; P2 03 93; 1A6Y, 

DR Inter Pro; I PRO 0053 6; Hormone_rec__lig . 

DR Inter Pro; IPR000504; RRM. 

DR InterPro; IPR001723; Strdhormone_receptor . 

DR InterPro; IPR001628; zf-C4. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PRO 03 98; STRDHORMONER . 

DR PRINTS; PRO 004 7; STROIDFINGER . 

DR SMART; SM00430; HOLI; 1. 

DR SMART; SM00399; ZnF_C4 ; 1. 

DR PROSITE; PS00031; NUCLEAR_RECE PTOR ; 1. 

DR PROSITE; PS00030; RRM_RNP_1 ; UNKNOWN_l . 

KW DNA-binding; Nuclear protein; Receptor; Transcription regulation; 

KW Zinc- finger. 

SQ SEQUENCE 491 AA; 56096 MW; F997E90A56A789D1 CRC64 ; 



Query Match 45.5%; Score 1434.5; DB 5; Length 491; 

Best Local Similarity 65.8%; Pred. No. 8.4e-99; 

293; Conservative 48; Mismatches 51; Indels 53; Gaps 

SPMSGKIVKEELSPPNSLSGVSSHSDGLKKKKLNHTPSTGWNTSASGPGGGVGGNVLNN 218 
I III :hll Mhl N I I I I I || 

SSMSG- - -REDLS-PSSLNGYSADSCDSKKKK GP TP 121 

RPPEELCLVCGDRSSGYHYNALTCEGCKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQ 278 

I MMMMIhlMMIMMMIIMMMIIIIIIMMIMIIIMMMMM 

RQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSITKNAVYQCKYGNNCEIDMYMRRKCQ 181 

E C RL KKCLT VGMR PE C W P E VQC A VKRKE KKAQRE KD K PNS T TD I S PEIIKIEP TE 334 

lllllllhlllllllllMIIMIIMIIIIhlllllllll: ||::||||| :| 
ECRLKKCLSVGMRPECWPEVQCAVKRKEKKAQKEKDKPNSTTNGSPDVIKIEPELSDSE 241 

MKIECGEPMIMGTPMPTVPYVKPLSSEQKE- -LIHRLVYFQDQYEAPSEKDMKRLTINNQ 392 

: I I I Ihl MMMMhMI | | i : , : | ■ | | 

KTLTNGRNRI SPEQEELILIHRLVYFQNEYEHPSEEDVKR- - IINQ 285 

NMDEYDEEKQSDTTYRIITEMTILTVQLIVEFAKRLPGFDKLVREDQITLLKACSSEAMM 452 

: I I I : : I IMMMIIMMMIMMMIhMIM MINIM M 

PI DGEDQCEIRFRHTTEITILTVQLIVEFAKRLPGFDKLLQEDQIALLKACSSEVMM 342 



Matches 


Qy 


159 


Db 


90 


Qy 


219 


Db 


122 


Qy 


279 


Db 


182 


Qy 


335 


Db 


242 


Qy 


393 


Db 


286 



Qy 453 FRVARKYDITTDSIVFANNQPFSADSYNKAGLGDAIENQLSFSRFMYNMKVDNAEYALLT 512 

IMhlh =111 = 1 1111= Mil Ihh I I • I I I IhllllllllMII 

Db 343 FRMARRYDVQSDS I LFVNNQPYPRDS YNLAGMGETI EDLLHFCRTMYSMKVDNAEYALLT 4 02 

Qy 513 AIVIFSSR PNLLDG WKVE K IQEIYLESL KA YVDNRDRDT AT VR YARL L S VLTE LRTLGNE 5 72 

HUM lhh:|MIMIIIII||:|:|||||| : = | : | || | | | | | | | | | | : 

Db 403 AIVIFSERPSLIEGWKVEKIQEIYLEALRAYVDNRRSPSRGTIFAKLLSVLTELRTLGNQ 462 

Qy 573 NSELCMTLKLKNRWPPFLAEIWDV 597 

M I = I = = I I f I I = :|||| UNI 
Db 463 NSEMCISLKLKNKKLPPFLDEIWDV 487 



Sequence Comparison B 



RESULT 2 

TMECDYREC 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



BASE COUNT 
ORIGIN 



TMECDYREC 2469 bp mRNA INV 14-OCT-1997 

T.molitor mRNA for ecdysone receptor. 

Y11533 

Y11533.1 GI:2155007 
ecdysone receptor; EcR gene, 
yellow mealworm. 
Tenebrio molitor 

Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insect a; 
Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; 
Cucujiformia; Tenebrionidae ; Tenebrio. 

1 (bases 1 to 2469) 

Mouillet, J.F. , Delbecque, J. P. , Quennedey,B. and Delachambre, J . 
Cloning of two putative ecdysteroid receptor isoforms from Tenebrio 
molitor and their developmental expression in the epidermis during 
metamorphosis 

Eur. J. Biochem. 248 (3), 856-863 (1997) 
98000286 

2 {bases 1 to 2469) 
Mouillet, J.F. 
Direct Submission 

Submitted (27-FEB-1997) J.F. Mouillet, Lab Zoologie-CNRS UMR 5548, 
Faculte des Sciences Gabriel, 6BD Gabriel, 21000 Dijon, France 

Location/Qualifiers 

1. .2469 

/ organi sm= " Tenebrio mol it or " 

/db_xref="taxon: 7067" 

838. .2313 

/gene="EcR" 

838. .2313 

/gene="EcR n 

/codon_start=l 

/product =" ecdysone receptor" 
/protein_id="CAA72296 .1" 
/db_xref ="GI =2155008" 
/db_xre f = " S PTREMBL : 00 2 0 3 5 " 

/ 1 ran s 1 a t i on= " MKRRW SG LQ A VR VT PE E S S S E VT S S S TTL VMS PAN S LAS TD I GD 
VDLEFWDLDLNGAKSRQRTHYHSKRIWIPGHTIIASNHHLAKSDTSSMSGREDLSPSS 
LNGYSADSCDSKKKKGPTPRQQEELCLVCGDRASGYHYNALTCEGCKGFFRRSITKNA 
VYQCKYGNNCEIDMYMRRKCQECRLKKCLSVGMRPECWPEVQCAVKRKEKKAQKEKD 
KPNSTTNGSPDVIKIEPELSDSEKTLTNGRNRISPEQEELILIHRLVYFQNEYEHPSE 
EDVKRIINQPIDGEDQCEIRFRHTTEITILTVQLIVEFAKRLPGFDKLLQEDQIALLK 
AC S S E VMMFRMARR YD VQ S D S I L F VNNQ P Y PRDS YNLAGMG E T I EDL LH FCRTM YS MK 
VDNAE YALLT A IVIFSERPSLIE GWKVE KI QE I YLE ALRAYVDNRRS P S RGT I FAKLL 
SVLTELRTLGNQNSEMCISLKLKNKKLPPFLDEIWDVDLKA" 
682 a 589 c 642 g 556 t 



Query Match 25.6%; Score 460.4; DB 3; 

Best Local Similarity 64.4%; Pred. No. le-79; 
Matches 74 2; Conservative 0; Mismatches 371; 



Length 2469; 
Indels 39; Gaps 



Qy 

Db 

Qy 
Db 

Qy 
Db 



64 0 aatgtgctgaacaaccgacctcccgaagagctgtgcctggtgtgtggcgaccggtcgtcc 699 

IN I II INI I Mill I II 1 1 1 1 1 1 II Ml I IMM 

1186 AAAGGTCCGACGCCTCGACAACAGGAAGAATTATGTTTGGTGTGCGGAGACAGAGCGTCC 124 5 
700 ggttaccattacaacgctctcacatgcgaaggatgcaaggggttcttccggaggagcatc 759 

IMM MM Ml I II Mill II Mill II II MM I III || 

1246 GGTTACCATTACAACGCATTGACTTGCGAGGGTTGCAAAGGTTTCTTCAGACGCAGTATA 1305 
760 accaagaacgccgtgtaccagtgcaagtacggcaacaattgcgaaatcgacatgtacata 819 

„ n<t i'l I'' I 'I I II Mill Mil I IIIIIMI III 

13 06 ACAAAGAACGCTGTGTATCAATGCAAATACGGAAACAATTGCGAAATAGACATGTATATG 1365 



Qy 820 aggcggaagtgccaggagtgccggctgaaaaaatgcctgaccgtcggcatgaggcctqaa 879 
nK „ Ill II IMIIM I I I I I I I I I I I | | || | | M| | T|| 

Db 1366 AGACGAAAGTGTCAAGAGTGCCGGTTGAAAAAGTGCCTCAGCGTCGGTATGAGGCCGGAA 14 2 5 

Qy 880 tgtgttgtacctgaagttcaatgcgcagtaaaaagaaaggagaaaaaagctcaacgagaa 939 
I'll! HIM 1 1 1 1 1 II M M M II Mill Mill Mill Ml III 

Db 1426 TGTGTGGTACCGGAAGTACAGTGTGCTGTTAAGAGAAAAGAGAAGAAAGCCCAAAAGGAA 1485 

Qy 940 aaagataaaccaaattctactacagacatttctcctgaaataataaaaatagaacctaca 999 
IIHIIIIIIIIII Mill II II II II I II Mill Mill I 

Db I4 86 AAAG AT AAAC C AAAC AG C ACTACT AACGGCTC ACC AG ACGTC ATCAAAAT TG AAC C AG AA 154 5 

Qy 1000 gagatgaagattgaatgtggtgaaccaatgataatgggcacacctatgccgactgtacct 1059 

1 I I" || I I I I I III 

Db 1546 TTGTCAGATTCAGAA AAAACATTGACTAACGGACGC 1581 

Qy 1060 tacgtgaaacctttgagttctgaacaaaaagaactgatccaccgacttgtctatttccag 1119 
Db 1582 AATAGG AT AT C AC C AG AG CAAG AGG AGCT C AT ACT C AT AC ATC^ 1641 
Qy 1120 gatcaatatgaagctcctagtgaaaaagacatgaaacgtttaacaataaataatcaaaat 1179 

nK _ I M INN I Ml MM Mill I Mill II I I I 

Db 1642 AACGAATATGAACATCCGTCTGAAGAAGACGTTAAACGGATTATCAATCAGCCGATA 1698 

Qy 1180 atggatgaatatgatgaagaaaaacaaagtgacaccacatatcgaatcatcactqaqatq 1239 

nh koo 1111 111 111 1111 1 IN I III II II 

1699 GATGGTGAAGATCAGTGTGAGATACGGTTTAGGCATACCACGGAAATT 1746 

Qy 1240 acaatactcacagttcaactgattgttgagtttgccaaacgattaccaggtttcgataaa 1299 

11 'I M II II II Mill II II II MM III II IMIIM I II I III M 

Db 1747 ACGATCCTGACTGTGCAGCTGATCGTGGAGTTTGCCAAGCGGTT^ 180 6 
Qy 1300 cttgtaagagaagatcaaatcactttactcaaggcttgctcaagtgaagctatgatgttc 1359 

II 1 MIMMMM Ml I Mill M Mill MM II I MM 1 1 

Db 1807 C T C CTGC AGGAAG ATCAAATTG CT C T CT TG AAGG C ATGTTC AAG CG AAGTGATG ATGTT C 1866 

Qy 1360 agggtagcaaggaagtatgacatcaccactgactcaatagtgtttgctaacaaccagcca 1419 

III 1 11 I II Ml M IMIIM I II I MM IMIIM 

Db 1867 AGGATGGCCCGACGTTACGACGTCCAGTCGGATTCCATCCTCTTCGTAAACAACCAG^ 1926 

Qy 1420 ttttcagctgattcatataacaaagctggattgggagatgccattgaaaaccaactgtca 1479 

111 M MM MM MM II II I I II I I I II 
Db 1927 TATCCGAGGGACAGTTACAATTTGGCCGGTATGGGGGAAACCATCGAAGATCTCTTGCAT 1986 

Qy 1480 ttcagtcggtttatgtacaatatgaaggtggataacgcagaatatgccttattgaccgcc 1539 

nh 100 M I I MIMM 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 1 Ml I M III 

Db 1987 TTTTGCAGAACTATGTACTCCATGAAGGTGGATAATGCCG^ 2046 

Qy 1540 atcgtcatattttcgagtaggccaaatttactaga 1599 

Db 2047 ATCGTTATTTTCTCAGAGCGACCGTCGTTGATAGAAGGCTG^ 2106 

Qy 1600 gaaatctacctagagtw 1659 

I MM Ml INN I II MM II II M III I I 

Db 2107 GAAATCTATTTAGAGGCATTGCGGGCGTACGTCGACAACCGAAGAAGCCCAAGCCGGGGC 2166 
Qy 1660 gtacgatatgcgcgacttctctcagtacttacagaattgcgcacattaggcaa 1719 

I " 20 frf?rrfr?rrf??r;?mTTMT" 9 M?rrrff?"frr?M 1,79 

Db 2227 TCAGAGATGTGCATCTCGTTGAAATTGAAAAAOiAAAAGTTACCGCCGTTCCTCGACG^ 2286 
Qy 1780 atatgggatgtc 1791 

II lllll III 

Db 2287 ATCTGGGACGTC 2298 



